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ADDENDA ET CORRIGENDA 

p. 120, 2nd table, 2nd row, for “r 6 —2r 9 ” read “r 6 —3r 9 ”. 


p. 144, para, beginning at line 8 should read 

“If there are p distinct dominant roots A 1 A 2 ... A„ and if k 1 k 2 ... k v are the 
corresponding modal rows, the procedure is as follows. Partition the (p, n) 
matrix { K lf K 2 , ...,k v } in the form [a,/?], where a is a (p, p) submatrix, assumed 
to be non-singular (rearrangement of the rows of u and columns of [a,/?] may be 
necessary to satisfy this condition). In this case the required matrix w is con¬ 
structed in the partitioned form 


K 7] 


V = u(I ~w) = u 



and then 



Evidently v has p zero columns $nd hence p zero latent roots. If rearrangement 
has been required, u must be in the corresponding rearranged form. 

The choice of a non-singular submatrix a is a generalization of the choice of 
a non-zero element k t1 in the elimination of a single dominant root. 

This process is in effect that which is applied in the numerical example on 
p. 330.” 

p. 176, equation (4), denominator of third fraction, for 

“d<»(A,)(A-A,)” read u A r d<W(A r )(A~A r )”. 

p. 262, equation at bottom, interchange first and third matrices on the right- 
hand side. /' 

p. 277. It is to be noted that in the definition of x at line 8, a is used to denote 
a set of parameters. Thereafter a denotes the components of acceleration. 

p. 291, § 9*9. The following is a simple alternative proof of the reality of the 
roots of the determinantal equation J m (z) = 0 when A and E are real and 
symmetrical. 

Let z , k respectively denote any root and its associated modal column, and 
let z 9 k be the corresponding conjugates (see § 1*17). Then l 

zAk = Ek. (1) 

Premultiplication by k' yields 

zTc'Ak = k'Ek, (2) 

and by transposition 

zk'Ak = k'Ek. 

The conjugate relation is 

zk'Ak = k'Ek. (3) 

Comparison of (2) and (3) gives z = z, which shows that z is real. Thus by (1) 
k is real, and by (2) z is positive when the potential energy function is positive 
and definite. 

p. 310, § 10*2 (e), second sentence should read “The principle shows that first 
order errors in the mode yield only second order errors in the frequency as 
calculated by the equation of energy”. 

Also line 10 should read “used, and when U happens to be symmetrical, a 
convenient... 

p. 315, line 9 from bottom, for “Rayleigh’s principle will next be applied” 
read “Since U is symmetrical, the extension of Rayleigh’s principle given in 
§ 10-2 (e) can be applied... 


p. 363, § 12*3, line 4, for “given” read “are given”. 






PREFACE 


The number of published treatises on matrices is not large, and so far 
as we are aware this is the first which develops the subject with special 
reference to its applications to differential equations and classical 
mechanics. The book is written primarily for students of applied 
mathematics who have no previous knowledge of matrices, and we 
hope that it will help to bring about a wider appreciation of the 
conciseness and power of matrices and of their convenience in com¬ 
putation. The general scope of the book is elementary, but occasional 
discussions of advanced questions are not avoided. The sections con¬ 
taining these discussions, which may with advantage be omitted at the 
first reading, are distinguished by an asterisk. 

The first four chapters give an account of those properties of 
matrices which are required later for the applications. Chapters i 
to m introduce the general theory of matrices, while Chapter iv is 
devoted to various numerical processes, such as the reciprocation 
of matrices, the solution of algebraic equations, and the calculation 
of latent roots of matrices by iterative methods. 

The remainder of the book is concerned with applications. Chapters 
v and vi deal in some detail with systems of linear ordinary differential 
equations with constant coefficients, and Chapter vn contains ex¬ 
amples of numerical solutions of systems of linear differential equations 
with variable coefficients. The last six chapters take up the subject 
of mechanics. They include an account of the kinematics and dynamics 
of systems, a separate discussion of motions governed by linear dif¬ 
ferential equations, illustrations of iterative methods of numerical 
solution, and a treatment of simple dynamical systems involving solid 
friction. The part played by friction in the motions of dynamical 
systems is as yet very incompletely understood, and we have con¬ 
sidered it useful to include a very brief description of some experi¬ 
mental tests of the theory. 

A considerable number of worked numerical examples has been 
included. It is our experience that the practical mathematician, 
whose requirements we have mainly considered, is often able to grasp 
the significance of a general algebraic theorem more thoroughly 
when it is illustrated in terms of actual numbers. For examples of 
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applications of dynamical theory we have usually chosen problems 
relating to the oscillations of aeroplanes or aeroplane structures. 
Such problems conveniently illustrate the properties of dissipative 
dynamical systems, and they have a considerable practical importance. 

A word of explanation is necessary in regard to the scheme of 
numbering adopted for paragraphs, equations, tables, and diagrams. 
The fourth paragraph of Chapter i, for example, is denoted by § 1-4. 
The two equations introduced in § 1*4 are numbered (1) and (2), 
but when it is necessary in later paragraphs to refer back to these 
equations they are described, respectively, as equations (1-4-1) and 
(1-4*2). Tables and diagrams are numbered in each paragraph in 
serial order: thus, the two consecutive tables which appear in § 7*13 
are called Tables 7-13-1 and 7-13-2, while the single diagram intro¬ 
duced is Fig. 7-13*1. 

The list of references makes no pretence to be complete, and in the 
case of theorems which are now so well established as to be almost 
classical, historical notices are not attempted. We believe that much 
of the subject-matter—particularly that relating to the applications 
—presents new features and has not appeared before in text-books. 
However, in a field so extensive and so widely explored as the theory 
of matrices, it would be rash to claim complete novelty for any 
particular theorem or method. 

The parts of the book dealing with applications are based very 
largely on various mathematical investigations carried out by us 
during the last seven years for the Aeronautical Research Committee. 
We wish to express our great indebtedness to that Committee and to 
the Executive Committee of the National Physical Laboratory for 
permission to refer to, and expand, a number of unpublished reports, 
and for granting many other facilities in the preparation of the book. 
We wish also to record our appreciation of the care which the Staff of 
the Cambridge University Press has devoted to the printing. 

Our thanks are also due to Miss Sylvia W. Skan of the Aerodynamics 
Department of the National Physical Laboratory for considerable 
assistance in computation and in the reading of proofs. 

R. A. F. 

W. J. D. 

A. R. C. 

March 1938 



CHAPTER I 


FUNDAMENTAL DEFINITIONS AND 
ELEMENTARY PROPERTIES 


1*1. Preliminary Remarks. Matrices are sets of numbers or 
other elements which are arranged in rows and columns as in a double 
entry table and which obey certain rules of addition and multiplication. 
These rules will be explained in §§ 1*3, 1*4. 

Rectangular arrays of numbers are of course very familiar in geo¬ 
metry and physics. For example, an ordinary three-dimensional 
vector is represented by three numbers called its components arranged 
in one row, while the state of stress at a point in a medium can be 
represented by nine numbers arranged in three rows and three columns. 
However, two points must be emphasised in relation to matrices. 
Firstly, the idea of a matrix implies the treatment of its elements taken 
as a whole and in their proper arrangement. Secondly, matrices are 
something more than the mere arrays of their elements, in view of the 
rules for their addition and multiplication. 


1*2. Notation and Principal Types of Matrix, (a) Rectangular 
Matrices. The usual method of representing a matrix is to enclose the 
array of its elements within brackets, and in general square brackets 
are used for this purpose.* For instance, the matrix formed from the 

1 12 0 
5 6 1 

ri 12 

5 6 


array 


is represented by 


a- 


The meaning of other special brackets will be explained later. If a 
matrix contains lengthy numbers or complicated algebraic expressions, 
the elements in the rows can be shown separated by commas to avoid 
confusion. 

The typical element of a matrix such as 


r4 

1 _ 

A 12 

••• A l n 

A 21 

A 22 

... A 2n 

1- 

hx 

3 

M 

^4 m2 

A 


* Some writers employ thick round brackets or double lines. 
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NOTATION 


1-2 


can be denoted by A ip where the suffices i and j are understood to 
range from 1 tom and from 1 to n, respectively. A convenient abbre¬ 
viated notation for the complete matrix is then [A^], but in cases 
where no confusion can arise it is preferable to omit the matrix brackets 
and the suffices altogether and to write the matrix simply as A* 

The letters i, j are generally used in the sense just explained as 
suffices for a typical element of a matrix. Specific elements will 
generally have other suffices, such as m, n, r, s. 

( b) Order. A matrix having m rows and n columns is said to be of 
order m by n. For greater brevity, such a matrix will usually be re¬ 
ferred to as an (m, n) matrix; the bar shows which of the two numbers 
m, n relates to the rows.* 

(c) Column Matrices and Row Matrices. A matrix having m elements 
arranged in a single column—namely, an (m, 1) matrix—will be called 
a column matrix. A column of numbers occupies much vertical space, 
and it is often preferable to adopt the convention that a single row of 
elements enclosed within braces represents a column matrix. For 


instance, 


fo, x 2 ,x 3 } = 



A literal matrix such as the above can be written in the abbreviated 


form {zj. 

In the same way a matrix with only a single row of elements will be 
spoken of as a row matrix, f When it is necessary to write a row matrix 
at length, the usual square brackets will be employed; but the special 
brackets [ J will be used to denote a literal row matrix in the abbreviated 
form. For example, s ^ y%> ya] . 

In accordance with the foregoing conventions, the matrix formed 
from the rth column of an (m, n) matrix [A {j ] is 

{Aj r>^2r, A mr }, 

and this can be represented as {A ir }, provided that i is always taken to 
be the typical suffix and r the specific suffix. In the same way the 
matrix formed from the *th row of [Ay] is 

l^il* •••» ^inl» 


and this can be expressed as [A^\. 


* An altematiTe notation, wfaioh i a in <ramnt use, is [A]J. 

t A row matrix is often o&lled a lint matrix, a vector eftks first kind, or a prime; while a 
column matrix is refe rred to as a rector of toe second kind, or a point. 
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SPECIAL TYPES OF MATRIX 
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The most concise notation for column and row matrices is, as with 
matrices of a general order, by means of single letters. The particular 
type of matrix represented by a single letter will always be dear from 
the context. 

(d) Transposition of Matrices. The transposed A' of a matrix A is 
defined to be the matrix which has rows identical with the columns 
of A. Thus if A = [A {J ], then A' = [A#]. In particular the transposed 
of a column matrix is a row matrix, and vice versa. 

In this book an accent applied to a matrix will always denote the 
transposition of that matrix. 

(e) Square, Diagonal, and Unit Matrices. When the numbers of the 
rows and columns are equal, say n, the matrix is said to be square and 
of order n: the elements of type A u then lie in the principal diagonal. 
If all the elements other than those in the principal diagonal are zero, 
the matrix is called a diagonal matrix. The unit matrix of order n is 
defined to be the diagonal matrix of order n which has units for all its 
principal diagonal elements. It is denoted by I n , or more simply by I 
when the order is apparent.* 

(fj) Symmetrical and Skew Matrices. When A if = A if the matrix A 
is said to be symmetrical, and it is then identical with its transposed. 
If A if = — A jiy whereas the elements of type A u are not all zero, the 
matrix is skew, but if both A i} = —A J{ and A {i = 0 the matrix is skew 
symmetric or alternate. Both symmetrical and skew matrices are 
necessarily square. 

(g) Null Matrices. A matrix of which the elements are all zero is 
called a null matrix, and is represented by 0. 

Examples 

(i) (3,2) Matrix. ("1 2] . 



(ii) Row Matrix. [0, 1, — 3, 0]. 

(iii) Column Matrix. {2, — 1, — 3, 1}. 

(iv) Symmetrical Square Matrix. 



* On the Continent the symbol commonly used for the unit matrix is E. 






1-3. Summation of Matrices and Scalar Multipliers. Opera¬ 
tions with matrices involve operations with the elements of which they 
are composed. Unless the contrary is stated, these elements will always 
be understood to be numbers, real or complex, which obey the laws of 
ordinary algebra (i.e. scalars). It is, however, sometimes useful to 
consider matrices the elements of which are not ordinary numbers. 
For example, the elements may themselves be matrices (see § 1*7). 

(a) Equality of Matrices. Equal matrices are necessarily of the same 
order and have their corresponding elements equal. Thus A = B if 

= By, 

The equality of two matrices of order m by n implies by definition 
the satisfaction of win ordinary equations between their elements. 
Conversely, a set of ordinary equations can always be represented by 
a single equation between matrices. 

(b) Addition and Subtraction of Matrices. These operations can only 
be performed on matrices of the same order. 

The sum of two such matrices A and B is defined to be the matrix C 
the typical element of which is A.jj "I - B^j. Then 

A + B = C. .(1) 

The difference of A and B is similarly the matrix D the typical 
element D {j of which is A^ — B ir Then 

A-B = D. .(2) 

Since any element in a matrix sum is equal to the algebraic sum of the 
corresponding elements in the summed matrices, we see that the addi¬ 
tion of matrices is subject to the same laws as the addition of scalars. 
Thus the associative and commutative laws of addition hold good. 
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1-3 


(c) Scalar Multipliers. If A = U, it is natural to writ© equation (1) 
as 2A = C, with C {j = 2A {j . More generally, the convention is adopted 
that multiplication of a matrix by a scalar coefficient, say Z, written 
either before or after the matrix, is equivalent to multiplication of 
every element by Z. Thus, if 

IA — Al = G , 

then C {J = lA {i . 

The foregoing definitions and conventions are sufficient for the 
interpretation and reduction of any expression which is linear and 
homogeneous in a set of matrices of the same order. 


Examples 

(i) Matrix Equation Expressed as Scalar Equations. The single 


matrix equation 


a n a 12 — ^11 
- a 2l a 22A \p21 


J 12 

&22J 


yields the four scalar equations 

On = 6 U ; a 12 — b 12 ' f a 21 = b 21 \ a 22 = b 22 . 

(ii) Scalar Equations Expressed as Matrix Equation. The four 
scalar equations 

a 1 = bi\ a 2 = b 2 \ a% — b 2 \ a± = 64 

are contained in each of the matrix equations 

\ a i a 2 \ “ pi ^2! > 

[a 3 aj [b z 

[a 1} a 2 , a 3 , a 4 ] = b 2 t b Zi b^\, 

{®1, ®2> ®3> ^4} = {^i> ^2> ^4}* 

(iii) Sum of Matrices . 

ri 2 31 + T4 5 0] = f*5 7 91. 

[2 3 4j [5 6 7J [7 9 llj 


(iv) Difference of Matrices. 

[1 0 — 11 — I"— 1 1 21 = T2 -1 —31. 
[2 -7 3J [ 1 “2 Oj [l -5 3J 


(v) Scalar Multipliers. 

t. n-J-rr: 
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1*4 


1*4. Multiplication of Matrices. With matrix multiplication 
two essential facts must be borne in mind. Firstly, matrices are in 
general not commutative in multiplication. Secondly, two matrices 
can only be multiplied in a given order provided that a certain con¬ 
dition is satisfied. If the number of columns in B is equal to the number 
of rows in A, the two matrices are described as conformable, and they 
can then be multiplied in the order By. A. Specifically, if B is a {q,n ) 
matrix and A is an (», p) matrix, then the product BA is a (g,p) matrix. 
A scheme which expresses this rule very simply is 

(q,n)x(n,p) = (q,p). .(1) 

The product BA is referred to either as A premuUiplied by B, or as 
Bpoetmultiplied by A. 

We may now define the prooess of multiplication. To obtain the tth 
element in the jth column of the product Pm BA, select the tth row 
of B and thejth column of A, and sum the produots of their corre¬ 
sponding elements, beginning at the left-hand end and the top, 
respectively: thus 

'LB ir A ri . .(2) 

r—1 

As a particular case assume in (1) that q = 1 and p = 1, so that the 
first matrix has merely a single row, say [Bj J, of n elements, while the 
second has a single column, say {AJ, of n elements. The product 
[Bj J {A*} in this case is a (1,1) matrix, or a scalar*, which is given by 
(2) as the sum of the products of the corresponding elements in [B s J 
and {A<}. The general process of multiplication of two matrices may 
accordingly be interpreted as follows: To obtain the typical element 
P<i of the product BA, postmultiply the tth row of B by the jth column 
ofA. 

From (1) it is seen that two matrices B and A can be multiplied in 
both the orders BA and AB only provided they are of the types 
(p,n) and (n,p): the produots are then of the types (p,p) and (n ,»), 
respectively. In particular this condition is satisfied for square matrices 
of equal order: however, even in this case the two products are usually 
notthesame. Two matrices having the special property that PA = AB 
are said to commute or to be permutable. The unit matrix J, for instance, 
commutes with any square matrix of the same order. 

* The caution should bo added that, although a (1,1) matrix can always be treated 
as a scalar, the ooiwerse is only true when oonformability allows. 
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Examples 

(i) Product of Rectangular Matrices. 



"(4x2)-(2x3)-(lxl) + (2x3), (4 x 3) + (2 x 0) —(1 x 5) + (2 x 1)' 
(3 x 2) + (7 x 3) + (1 x 1) —(8 x 3), (3 x 3)-(7 x 0) +(1 x 5)-(8 x 1) 
.(2 x 2) - (4 x 3) - (3 x 1) + (1 x 3), (2 x 3) + (4 x 0) - (3 x 5) + (1 x 1). 



The rule (1) here gives (3,4) x (4,2) = (3,2). The matrices are not 
conformable when taken in the reverse order. 

(ii) Products of Square Matrices. 

r 3 4]p 21 = r (3 x l) + (4x 2), (3 x 2) +(4 x 5)1 = T 11, 26]. 

L-2 -lJU 5j [ —(2x 1) —(1 x2), — (2x2) —(1 x5)J [-4,-9j 

When the matrices are multiplied in the reverse order the product is 

IT 21T 3 41 = r(lx3)-(2x2), (lx4)-(2xl)l = r-1 21. 
[2 5JL —2 -1J [(2 x 3) — (5 x 2), (2 x 4) — (5 x 1)J [-4 3j 

Another illustration is provided by the pair of products 

' 3 4 21 ri 1 1] = f 21 21 21], 

-2 -1 -1 2 2 2 -9 -9 -9 

-1 -3 -1jL 5 5 5j [-12 -12 -12 

1 1 1] T 3 4 2] 0. 

2 2 2 -2 -1 -1 

5 5 sj [-1 ~3 -1. 

(iii) Products of Permutable Matrices. 

0 -3 1 ] ro -1 1 ] = ro -1 i] ro -3 n = 0 , 

2-110 1-1 0 1 - 12-11 

.2—1 lj LO 3 — 3J [0 3 — 3j -1 1. 

[a 6]p 0] = ri 0]fa 6] = [a 6]. 

[c dJLo lj [o lJLc d\ [c dj 
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PRODUCTS OF ROWS AND COLUMNS 


1-4 


(iv) Column Matrix Premultiplied by Row Matrix. 


[5 2 —3]{2 -1 4}== [5 2 -3] 


r 2 * 
■1 
4 


= —4. 


The rule (1) here gives (1,3) x (3,1) = (1,1); hence the product is a 
scalar. 


(v) Row Matrix Premultiplied by Column Matrix. 
{2 -1 4}[5 2 -3] = 


’ 2’ 

-1 

[5 

£ -3] = 

" 10 

-5 

4 

— 2 

-6' 

3 

4 



. 20 

8 

-12 


In this case the rule (1) gives (3,1) x (T, 3) = (3,3). Note that the 
product here is of a very special type. The elements in any row (or 
column) are proportional to the corresponding elements in any other 
row (or column), so that the product has in fact only a single linearly 
independent row (or column). Examples (ii) and (iii) contain other 
illustrations of square matrices with this property. More generally, 
an (m, n) matrix with only a single linearly independent row (or column) 
is always expressible as a product of the form {A^ |i^J, where {A^ is a 
column of m elements and [Bj J is a row of n elements. For instance, 
pO, - 5, 20J = pj [2, - 1, 4] = {5, 2} [2, - 1, 4]. 


(vi) Square Matrix Postmultiplied by Column Matrix. 


ax = 


«12 

a i3 


X 1 

= 

^11*^1 ^12*^2 a i3 X 3 


a 21 

a 22 

a 23 


X 2 


G'21 X 1 “b a 22 X 2 "b ®23 X 3 


J 1 31 

a 32 

a 33_ 


*3. 


_ a Zl x l + a Z2 X 2 + a 22 X 2. 


The system of linear algebraic equations 

a ll X l^~ a 12 X 2~^~ a l2 X 2 “ 

® 22 X 2 “^^ 23^3 “ ^ 2 > 

®31^1+ a 32 a '2^“®33* l '3 = ^3 

is accordingly concisely expressible as the matrix equation 

ax = b. 


(vii) Abbreviated Rules for Products of Special Matrices. 

Matrix x column = column, 

Row x matrix = row, 

Row x column = scalar, 

Column x row = matrix with proportional rows and proportional 
columns. 
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1*5. Continued Products of Matrices. A continued product of 
matrices, such as CBA , is to be interpreted as follows. First pre¬ 
multiply A by B, and then premultiply the product BA by C. This 
process will of course not be possible unless B is conformable with A 
and C with BA. 

In the foregoing definition of a continued product a specific order of 
multiplication is laid down. However, it will now be shown that the 
associative law holds good for matrix multiplication, so that the 
factors in a product may be grouped in any convenient manner, 
provided that the order of multiplication is not altered. 

The associative law requires that if Y = CB and X = BA, then 

CBA = YA = CX . .(1) 

To prove formally that this is a consequence of the definitions of § 1-4, 
we note firstly that equation (1*4*2) gives for the typical element of X 

= 2 Bir^rj, 

r=l 

where n is the number of columns in B and of rows in A. Hence the 
(k,j )th element of CX is 

m m n 

S =2 £ C ki B ir A r] , (2) 

i=l i=i r =i 

where m is the number of columns in C. Similarly, since 

Yjcr = 2 C ki B ir , 

1 

the (k,j )th element of YA is 

XCtoB^, (3) 

r=l i—lr—1 

in agreement with (2). This proves the truth of (1). 

In view of the associative law, it will now be clear that a continued 
product, or product chain, such as 

1 ••• 2*^-1 

will only have a meaning provided that adjacent matrices A 8 , A s _ x in 
the chain are conformable. Thus, if r 8 and c 8 denote, respectively, the 
number of rows and columns in A 8 , the conditions to be satisfied are 

c* = r 8 _ i 

for a = m, m — 1,..., 2. The scheme of multiplication in this case may be 
represented by 

(r m ,r m -i) x (f m -v r m - 2 ) x ... x (f 2 , r t ) x (r v c x ) = (f m ,c x ). 

The product, accordingly, is a matrix having r m rows and c x columns. 
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CONTINUED PRODUCTS 


1-5 


For brevity, the product of two equal square matrices A is written 
A % , and a similar notation is adopted for the other positive integral 
powers. 

The distributive law also holds good for matrix multiplication. For 
example, ^ + B)F = EAF+EBF. .(4) 


The correctness of this follows at once from the formula (2) and the 
definition of addition. 


Examplbs 

(i) Associative Law. Compare 

r 3 4] T1 2] x [2 -1] = r 11 26] p - 1] = r 22 67] 
[-2 — lj [2 sj [0 3j [-4 — 9J [0 3j L-8 — 23j 

with 

n-nx-j-u -w 

(ii) Buie for Product. With 

P = {2 -1 4}[6 2 — 3]{1 0 2}[— 1 2].(7) 

the rule for the product gives 

(3,1) x (1,3) x (3,1) x (T, 2) = (3,2). 

The product thus exists, and is a matrix with three rows and two 
columns. To evaluate P, note that the part-product [6 2 — 3] {1 0 2} 
yields the scalar — 1, which may be brought to the front: hence 

P = -1 x{2 -1 4}[ — 1 2] = T 2-4'. 

-1 2 
L 4 -8. 

(iii) Positive Powers of a Square Matrix. 

[j rn au: 

r 3 4] 3 = r 3 4] r i 16] = r 1 ie] r 3 4] = [-29 20]. 

L-2 lj [ — 2 lj L-8 — 7J [-8 -7J L-2 lj |.-10 -39j 
Note that positive integral powers of a square matrix are permutable. 

(iv) Computation from “ Bight to Left". In this method the complete 
product is evaluated by successive premultiplications. For instance, 
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{1 0 2 }[ —1 2 ] = 


r-i 2 - 

o o 

-2 4 


[5 2 -3] 


-1 2 ' 
0 0 
-2 4. 


= [1 - 2 ], 


and finally P = {2 —1 4} [1 —2] 


2 -4 
-1 2 
4 -8 


Equation (6) is a further simple illustration of computation from 
right to left. 

(v) Computation from “Left to Right”. Here successive post¬ 
multiplications are used. Thus, if the complete product is (7), 


{2 -1 4}[5 2 -3] = 


10 

-5 

20 


4 -6 

-2 3 

8-12 


10 

-5 

20 


4 -6 

-2 3 

8-12 


{1 0 2} = {— 2 1 -4}, 


and lastly, P = {-2 1 — 4}[ — 1 2] = 


Equation (5) provides another example. 


2 -4 
-1 2 
4 -8 


(vi) Uae of Subproducts. In this treatment the complete product 
chain is first split into a convenient set of subproducts, which are 
separately computed. The calculation is completed by multiplication 
of the chain of subproducts, or by separation into further subproducts. 
For instance, 

-in: 

= r 11 26] r-9 -26] = 5/ 2 . 

L-4 —9j[ 4 llj 
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DIAGONAL MATRICES 


1 - 5 - 1-6 


Some of the methods of numerical solution of differential equations 
to be described in Chapter vii involve the computation of lengthy 
product chains of square matrices. With such product chains it is often 
preferable to adopt the method of subproducts rather than to compute 
the complete chain directly from left to right, or right to left; the use 
of subproducts facilitates the correction of errors. For instance, sup¬ 
pose the second matrix from the left, namely ["1 21, to have been 

[2 5J 

incorrectly entered as [1 21. The rectification of this error involves 

[2 3j 

a recalculation of the first subproduct but no recalculation of the 
other subproducts. On the other hand, if the chain is computed from 
left to right, the correction is much more troublesome. 

A further important case is that in which a chain of square matrices 
is postmultiplied by a column matrix. For instance, consider the 


product 



Here computation from right to left obviously requires the least 
labour, since at each stage merely the postmultiplication of a square 
matrix by a column matrix is involved. 


1-6. Properties of Diagonal and Unit Matrices. Suppose A 

to be a square matrix of order n , and B to be a diagonal* matrix of the 

same order. If n D . 

P = BA , 


P {j — 2 Bi r A r j — 

r=l 


since in the present case B ir = 0 unless r = i. Hence premultiplication 
by a diagonal matrix has the effect of multiplying every element in any 
row of a matrix by a constant. Similarly, postmultiplication by a 
diagonal matrix results in the multiplication of every element in any 
column of a matrix by a constant. 

Diagonal matrices are clearly permutable with one another, and the 
product is always another diagonal matrix. In the particular case 


where A i{ = B^ 1 , 


BA = AB = I. 


These relations are characteristic of reciprocal or inverse matrices (see 

§ I'll)- 

* For definition of diagonal matrix see § 1*2 (e). 
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From the definition of the unit matrix it follows that if A is any 

square matrix IA = AI = A } .(2) 

and that I m = I. .(3) 

It is on account of the properties expressed by (2) and (3) that the 
name unit matrix is justified. 

A diagonal matrix whose diagonal elements are all equal is oalled a 
scalar matrix. It is equal to the unit matrix multiplied by a scalar. 


Examples 

(i) Premultiplication by Diagonal Matrix. 

■i o oi r 2 o 7i = r2 o 7*1. 

0 -4 0 -1 4 5 4 -16 -20 

0 0 3j L 3 1 2J |9 3 6 

(ii) Postmultiplication by Diagonal Matrix. 

' 2 0 7 ] ri 0 01 = r 2 0 211. 

-1 4 5 0 -4 0 -1 -16 15 

3 1 2J [0 0 3J L3-4 6 

(iii) Commutative Properties. 

1 0 oi r2 0 oi = r2 0 oi ri 0 oi = r 2 0 0‘ 

0 -4 0040 0400 -4 0 0 -16 0 

0 0 3J |.0 0 2J [0 0 2J |0 0 3J L0 0 6. 


(iv) Reciprocal Diagonal Matrices. 


01 

0 

0“ 

10 

0 

ii 

o 

0 

-0-2 

0 

0 

-5 

0 

0 

0 

-4 

. 0 

0 

-0*25 


(y) Scalar Matrices. 


2 

0 

0- 

'3 

0 

0‘ 

0 

-2 

0 

0 

3 

0 

0 

0 

— 2 

.0 

0 

3 


1*7. Partitioning of Matrices into Submatrices. The array of 
a given matrix may be divided into smaller arrays by horizontal and 
vertical lines, and the matrix is then said to be partitioned into sub¬ 
matrices. The partitioning lines are usually shown dotted, as in 


- 3 

0 

2' 

-2 

-1 

-1 

.-1 

-3 

5_ 
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PARTITIONING 


1*7 


Each of the submatrices in a partitioned matrix may be represented 
by a single symbol, and the original matrix becomes a matrix of 
matrices? in this case the partitioning lines will usually be omitted, or 
perhaps indicated by commas. With certain fairly obvious restrictions 
partitioned matrices can be added and multiplied as if the submatrices 
were ordinary matrix elements, and the result of any such operation 
is the same as if the operation were performed on the original un¬ 
partitioned matrices. 

Suppose two matrices of the same order to be partitioned in a corre¬ 
sponding manner. Then the submatrices occupying corresponding 
positions will be of the same order, and may be added. Since each 
element in the sum is the sum of the individual elements, it is clear that 
the sum of the partitioned matrices is equal to the sum of the original 
matrices. 

Multiplication of partitioned matrices requires more detailed con¬ 
sideration. Let BA = P, where B is of type (m,p) and A of type (p, n). 
Suppose now that B is partitioned by, say, two vertical lines into three 
submatrices B v B 2 , B 3 ; thus B == [B lt B % , P 8 ]. Then since B now has 
three (matrix) elements arranged in a line, conformability requires that 
A shall be partitioned into three (matrix) elements A v A % , A a arranged 
ina column; also the products B x A lt B 3 A t , B 3 A 3 must be conformable. 
Hence the p columns of B and the p rows of A must be correspondingly 
partitioned by vertical and horizontal lines, respectively. Clearly the 
values of P given by BA and by B 1 A 1 + B 3 A % + B 3 A 3 will be identical, 
for the partitioning has only the effect of splitting the typical sum 

p 

2 B ir A ri into three portions which are subsequently added. Since 

r—l 

the numbers of rows in B and of columns in A are quite unrelated, 
any horizontal partitioning of B and vertical partitioning of A may 
be quite arbitrary: the conformability of the submatrices will not be 
affected. 

s^in general, for the multiplication of two partitioned matrices, it 
- is necessary and sufficient that for every partitioning line between 
1 columns of the matrix on the left there shall be a partitioning line 
between the corresponding rows of the matrix on the right, and no 
farther horizontal partitioning on the right. 
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(i) Addition. 


Examples 




'2 0 4' 

+ 

Vh 

© 

eo 

-s 

ftr ! ^ ^ «r I ^ 

0 2 7 


0 3-2 


2I * ! 7 3/ * i — 2 

.-3 8-1. 


.8 -3 6. 


L— 3 " 8 ! —iJ is -si eJ 



(ii) Multiplication. 


’ 3 

0 

2 " 

1 

-1 

4 ‘ 

= 

' 31 0 

2 “ 

1 

-1 

4 ' 

— 2 

-1 

-1 

2 

3 

0 


— 2 J — 1 

-1 

2 

3 

0 

.-1 

-3 

5 . 

-6 

0 

2 - 


- — 1 ; — 3 

5 - 

-5 

” 0 ” 

2 - 


= { 3 , - 2 ,- 1 } [ 1 , - 1 , 4 ] + { 0 , - 1 , - 3 } [ 2 , 3 , 0 ] + { 2 , - 1 , 6 } [ 6 , 0 , 2 ] 


as 

3 -3 121 

+ 

© 

© 

© 

+ 

■ 10 0 4 ' 


1 

to 

to 

1 

QO 


-2 -3 0 


-6 0 -2 


L-i 1 


.-6 -9 0 . 


. 26 0 10 . 


-9 -1 
18 -8 


16 ' 

-10 

6 


Another scheme of partitioning for the same product is 


’ 3 

0 ! 

2 ' 

1 

-1 

4 “ 

— 2 

- l ! 

_ _ l 

-1 

2 

3 

0 

.-T 

- 3 ! 

5 . 

-5 

0 

2 . 


Ill -3 

tO H - 

1 

CO H - 

© 

_1 

1-3 

[6 0 2] 

[-1 - 3 ] 

ri -1 4 
[2 3 0 . 

+ 6[6 

0 2] 


r 3-3 121 + r 10 0 4 T 

[ — 4 -1 — 8 j [-6 0 -2J 


‘ 13 -3 16 " 

-9 -1 -10 

[-7 -8 - 4 ] + [26 0 10 ], 


. 18 -8 6 


The convenience of partitioning when the matrices include many zero 



4 



6 2 

© 

© 

1 -2 

0 0 

2 1 

© 

© 

-2 6 

0 0 

0 0 

8 3 

© 

© 

to 

1 

CO 

O 

O 

6 2 . 

Loo 

-6 8 . 


E 3 U 1 ] 


o 

o 


0 

0 


0 

0 


■ 8,3 

.6,2 


0 

0 


2,-3 

- 6 , 8 
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DETERMINANTS 


1-8 


1*8. Determinants of Square Matrices. The determinant of a 
square matrix A is the determinant whose array of elements is identical 
with that of A , and it is represented by | A |. It is shown in treatises 
on determinants that the product of two determinants | B | and | A | 
of the same order m can be represented by another determinant of 
order m . The multiplication rule for determinants can be expressed in 
several different ways, but one of them is as follows: 

|2f|xM|-|(7|, .(1) 

7 n 

if Cy = S BirA r j- 

r-1 

The array of the elements in | C | is thus identical with that in the 
matrix product BA. Hence the determinant of the product of two 
matrices equals the product of their determinants. It should be noted 
particularly that if c is a scalar and A is a square matrix of order ra, 

then \cA\=c m \A\. .(2) 

Square matrices are also associated with minor determinants and 
cofactors. It may be recalled that the first minor of the determinant 
| A |, corresponding to the element A ij} is defined to be the determinant 
obtained by omission of the ith row and the jth column of | A |; while 
the cofactor of A {j is this minor multiplied by (— l) i+ K If | A | is of 
order ra, any first minor is of order m— 1. Similarly, the determinant 
obtained by omission of any a rows and a columns from | A | is called 
an «th minor, or a minor of order m — a. 

Methods for the computation of determinants and cofactors by the 
use of matrices are described in Chapter iv. A few special properties 
of determinants to which reference will be made later are given in the 
examples which follow. 

Examples 


(i) Differentiation of Determinants. Suppose A(A) to denote a deter¬ 
minant of order m, the elements of which are polynomials of a para¬ 
meter A: for instance, 


A(A) = 


Ai(A) /„(A) ... / lm (A) 

/21M /22W ••• AmW 


fmlW /m2(^) ••• fmmW 

Then by the usual rule for the differentiation of determinants 


-yr- IS the 

dX 
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s um o f the determinants obtained when any one column is differentiated 
and the remaining columns are taken as they stand. "Since each "such 
determinant is expansible in terms of first minors of A(A), it follows that 

is a linear homogeneous function of the first minors A x of A(A). 
(La 

d 2 A 

Hence also is a similar function of the second minors A 2 of A(A), 


dP A. 


and more generally is linear in the pth minors A p . 


(ii) Determinants whose Minors of a Certain Order all Vanish. 
Suppose that, when A = X 8 , at least one of the qth minors A q of A(A) 
does not vanish, whereas all the (q— l)th minors A^ do vanish, so 
that A —A s is a factor of every minor \_ v Now, by the theorem 


proved in example (i), 


dA t 


dX 


is linear in the first minors of A q _ 2 —that is. 


in the minors A^ of A(A). Hence A — A s is certainly a factor of 


dA a t 


q-2 


dX 


so that (A — AJ 2 at least is a factor of every minor A q _ 2 . In the same way 

is linear in the minors A 3 _ 2 , and thus contains the factor (A — A s ) 2 . 

Accordingly (A - A s ) 3 at least is a factor of every minor A q _ z . Proceeding 
in this way we see finally that (A — AJ® -1 at least is a factor of every 
first minor A x of A(A), while (A — AJ 5 at least is a factor of A(A) itself. 
For instance, suppose 


A(A) = 


10 0 0 
0 A 0 0 
0 0 A 0 
0 0 0 A 


Here, when A = 0, all the second minors of A(A) are zero, but one third 
min or is not zero. Clearly A 2 is a factor of every first minor, while A 3 
is a factor of A(A). 


Again, if 


A(A) = 


10 0 0 
0 A 0 0 

0 0 A 2 0 
0 0 0 A 3 


we see that as before, when A = 0, all second minors vanish whereas 
one third minor does not. In this case A 3 is a factor of every first minor 
and A® is a factor of A(A). 
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• 1*9. Singular Matrices, Degeneracy, and Rank. In general the 
rows of a square matrix are linearly independent, but examples have 
already been given of matrices which do not satisfy this condition. 
If the determinant of a square matrix vanishes—s6 that the rows are 
not linearly independent—the matrix is said to be singular. 

The rows of a singular matrix may be linearly connected by only a 
single relation, in which case the matrix is said to be simply degenerate , 
or to have degeneracy * 1. If the rows are linearly connected by more 
than a single relation, the matrix is multiply degenerate , and in fact the 
degeneracy is q if there are q such relations. In accordance with this 
definition the rows of a square matrix of order m and degeneracy q will 
all be expressible as linear combinations of m — q linearly distinct rows 
of elements. The quantity m — qi& usually spoken of as the rank of the 
matrix. The preceding considerations are, of course, true equally for 
the rows and the columns. The formal definitions of degeneracy and 
rank are as follows: A square matrix of order m is of degeneracy q when 
at least one of its gth minor determinants does not vanish, whereas 
all its (q— l)th minor determinants do vanish. The rank then is m — q. 

From the preceding remarks it will be clear that the question of the 
degeneracy of a square matrix a of order m is intimately bound up 
with the number of linearly independent sets of quantities x which 
can satisfy the m homogeneous scalar equations contained in the 
matrix equation ax = 0 (see example (vi) of § 1*4). Here x denotes a 
column of m unknowns. When a is non-singular x — 0 is the only 
solution. When a is simply degenerate a non-zero column, say x = x, 
can be found to satisfy the equation, and the most general solution 
then is an arbitrary multiple of x. Again, when a is doubly degenerate 
two distinct non-zero columns can be found, say x(l) and x(2), and 
the most general solution is an arbitrary linear combination of these. 
More generally, when a has degeneracy q there are q linearly inde¬ 
pendent non-zero solutions. 


Examples 


(i) Non-Singular Matrix. The matrix 


-1 0 2 
0 1 OJ 


* Also called nullity by some writers. 
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is non-singular, since | a | + 0. Also x = 0 is the only solution of the 
matrix equation ax — 0, i.e. of the three scalar equations 

x 1 + 2x a +3x 3 = 0, 

-Xj + 2x s = 0, 

x a = 0. 

(ii) Simply Degenerate, Matrix. The matrix 



is of degeneracy 1 (or rank 3 — 1 = 2), since at least one first minor of 
| a | does not vanish whereas \a \ = 0. Hence a single linear relation 
connects the rows (or the columns). In fact 

[1, 2, 9] — 2[2, 0 ,2] + [3, — 2, — 5] = 0, 
and {1, 2, 3} + 4{2,0,-2}-{9, 2,-5} = 0. 

From the second of these relations we see that, in the present 
case, the equations ax = 0, or 

Xj -f- 2x a "I* 9x s = 0, 

2 xj *f 2x a — 0, 

3 Xj — 2x a — 5x a = 0, 

have the particular solution x = (1, 4, - l}=x, and the general 
solution x = cac, where c is an arbitrary constant. 


(iii) Multiply Degenerate Matrices. The matrix 


os 


'0 0 0 O ' 
0 0 0 0 
0 0 10 
.0 0 0 1 . 


is of degeneracy 2 and rank 2. The equations ax = 0 can here be satisfied 
by x(l) = {1,0,0,0} and as(2) = {0,1,0,0}, and the most general 
solution is x = c 1 ac( 1) -+- c s ac(2). 

As another illustration we may take 

'1 2 3 - 1 ' . 

2 4 0 -2 

-1 -2 -3 1 

. 0 0 0 0 . 


as 
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This matrix is of degeneracy 3 and rank 1. The equations ax = 0 are 
satisfied by the three columns 

05(1) = {2, — 1, 0,0}, 

*(2) = {3, 0,-1, 0}, 

*(3) = {1, 0, 0, 1}, 

and the most general solution is 

x = c 1 a?(l) + c 2 ap(2)4-c 3 ac(3). 

(iv) Singular Matrices Expressed as Products. A square matrix a 
which has proportional columns and rows is of rank 1, and conversely. 
Such a matrix has effectively one linearly independent column and 
one linearly independent row, and is expressible as a product of the 
type {6J For instance, the second matrix in example (iii) can be 
written as the product 

{1, 2, — 1, 0} [1, 2, 3,-1]. 

In the same way a square matrix a of rank 2 is expressible as a product 
of the type 6(3, in which b consists of two independent columns and 
(3 consists of two independent rows. The columns of a are all linear 
combinations of the two columns of 5, and the rows of a are all linear 
combinations of the rows of (3 . For instance, the first matrix in example 
(iii) can be written as 

ro oi ro o 1 oi, 

0 0 [o 0 0 lj 
1 0 
.0 1- 

4 

while similarly the matrix a of rank 2 in example (ii) can be represented 
in the following ways: 

T 2 91 = Tl 21 p 0 11 1[ 5 “ 1 1 1- 

2 0 2 2 0 [0 1 4j 2 2 (0 1 4J 

.3 -2 — 5j |_3 — 2j |3 — 7j 

More generally a square matrix a of order n and rank r can be repre¬ 
sented by 

* a = bp, 

in which b is an (n, r) matrix with linearly distinct columns and (3 is 
an (f, n) matrix with linearly distinct rows. 
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1*10. Adjoint Matrices. If [a^] is a square matrix and is the 
cofaetor of a {j in | a |, the matrix [ A yj is said to be the adjoint of [i a 4i \. 
It should be noted carefully that the adjoint of a matrix a is the 
transposed of the matrix of the cofactors of a. 

Since by the properties of determinants 


n 



S a ir A ir = 0 

1 

if i + j, >' ■ 


and 

n 

2 «ir^ir = | « 
r=l 

1. ' 


it follows that 

[^] | 

a 1 1. 

. (i) 

Similarly, 

= 1 

| a | /. 

.(2) 

From equations (1- 

8 -2) and (1) we see 

also that 



1 A n 1 = 1® 

|n—l 

.(3) 


The properties of the adjoint matrix expressed by equations (1) and 
(2) are of great importance. In the special case where a is singular, so 
that I a I =0, 


aA — Aa = 0. 


.( 4 ) 


Anticipating results given in § 112 regarding square matrices having 
a null product, we can deduce from (4) that the adjoint of a simply 
degenerate matrix a necessarily has unit rank. If a has multiple 
degeneracy, the adjoint is by definition null. 


Examples 

(i) Adjoint of Non-Singular Matrix . 


[ a ij] — 

r 2 0 71 

9 

; i A n] = 

• 3 7 — 28" 


-1 4 


17 -17 -17 


/ 3 1 2. 


.-13 -2 8 


The product of these two matrices in either order is 


‘-85 

0 

0 " 

= -85/ 3 = 

2 

0 

7 

0 

-85 

0 


-1 

4 

5 

0 

0 

-85 


3 

1 

2 


(ii) Adjoint of Simply Degenerate Matrix . 


II 

A 

’2 0 2' 

-14 3 

; [^] = 

’ 13 2-8' 

13 2-8 


.314. 


.-13 -2 8 


In this case both products aA and Aa vanish, and A has unit rank. 
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(iii) Adjoint of Multiply Degenerate Matrix. 

* o o o o j [^ 1^3 * o* 

0 0 0 0 
0 0 10 
.0 0 0 2 . 

Note that here a is of rank 2 and degeneraoy 2, and that A is null. 

1*11. Reciprocal Matrices and Division. If a is a non-singular 
square matrix, the elements of the adjoint A may be divided by | o |. 
The matrix so obtained is called the reciprocal or inverse of a, and is 
written ar 1 . By (M0-1) and (1-10-2) it follows that 

aa~ l = a _1 a = I. .(1) 

It is easy to see that if a is given, the reciprocal of a is the only square 
matrix x which satisfies the equation ax = I. Similarly, a -1 is the only 
matrix which satisfies xa = I. 

If a square matrix is not singular it possesses a reciprocal, and 
multiplication by this reciprocal is in many ways analogous to division 
in ordinary algebra. In conformity with the terms adopted for matrix 
multiplication, we may refer to b~ x a (when it exists) as a predivided 
by b, and to ab~ x as a postdivided by b. 

Methods for the computation of reciprocal matrices will be described 
in Chapter iv. 

Examples 

(i) Reciprocal Matrices. 

o = 1*3 2 0 -11; a 1 = T 1 1 -2 -4l . 

0221 010-1 
1 _2 -3 -2 -1-13 6 

.0 1 2 lJ »2 1-6-10. 

The product of these two matrices can be verified to be J 4 . 

(ii) Predivision and Postdivision. 

*"■[_?-g- 

Predivision of a by 6 gives 

ruT 

Postdivision of a by 6 gives 

i-i auTU~.ii J 
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1*12. Square Matrices with Null Product. In scalar algebra 
the equation ab = 0 can only be satisfied when at least one of the 
factors a and b vanishes. However, if a and b are matrices, their pro¬ 
duct can be null even though neither factor is null. If one factor is 
null, then obviously the other is arbitrary; but if neither factor is null, 
then both factors must be singular. 

Let ab = 0, where the matrices are square and of order n. Since all 
the elements in the pth column of the product vanish, it follows that 

a U^ljp + a 12^2p + ••• + a ln^np ~ 
a 21^1p+ a 22^2p + ‘*-+ a 2n^nu = 


a nl&lp + a n2&2p+--*+ a rm&np = 

Firstly, suppose that | a | is not zero. Then the foregoing equations 
can only be satisfied if all the elements of the pth column of b vanish. 
Hence b is null, i.e. it is of rank 0. The rank of a is n , so that in this case 
the sum of the ranks of a and b is ft. Next suppose that | a | vanishes, 
while the cofactors of its elements are not all zero, i.e. let a be of rank 
ft — 1. The equations can now be satisfied by non-zero values for some, 
at least, of the elements b ip , and the ratios of these elements are 
uniquely determined. But the elements b ^ in the gth column of 6 must 
also satisfy the equations and must therefore be proportional to the 
corresponding elements of the pth column. Hence b is of rank 1. 
Evidently 6 can also be null, i.e. of rank 0; hence the sum of the ranks 
of a and of b is in this case either n or n — 1. 

More generally it can be shown that if the product of two square 
matrices is null the sum of their ranks cannot exceed their order. 
This is a particular case of Sylvester’s law of degeneracy,* which states 
that the degeneracy of the product of two matrices is at least as great 
' as the degeneracy of either factor, and at most as great as the sum of 
the degeneracies of the factors (see example (ii)). This law is illustrated 
in a simple way when both matrices concerned are diagonal, with 
certain diagonal elements zero. Suppose the degeneracies of a, b are 
p, q> respectively; this implies that p diagonal elements of a, and q of 
6 , are zero. Evidently if p ^ q and all the q diagonal ciphers of b are in 
positions corresponding to ciphers in the diagonal of a, then the product 
will contain only p diagonal ciphers, i.e. the degeneracy of the product 
is p. On the other hand, if none of the q ciphers in 6 is in a position 

\/* Often called Sylvester’s law of nullity. A short proof is given on p. 78 of Ref. 1. 
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corresponding to a cipher in a (which will be possible provided p + q>n), 
then the product will contain p + q ciphers; i.e. the degeneracy will be 
P+9- 

Examples 

(i) Square Matrices with Null Product . Let 

a=n 0 0 0 ], b = ro 0 0 01 . 

0 1 0 0 0 0 0 0 

0 0 0 0 0 0 1 0 

-0 0 0 OJ Lo 0 0 1 . 

Here ab = 0, both matrices are of rank 2, and 2 + 2 > 4. Note that if 
the rank of either matrix is increased by the addition of a further unit 
in the principal diagonal, the product cannot vanish. 

Again, if a = p 1 1*1, b = T 3 4 21, 

2 2 2 -2-1-1 

5 5 5J [-1-3-1. 

then ab = 0. Here a is of rank 1, b is of rank 2, and 1 + 2 > 3. 

(ii) Sylvester's Law of Degeneracy. The validity of this law is 
illustrated by the following considerations. Let a, b have degeneracies 
p, q, respectively. Then it is possible to represent a by a product Aoc, 
where A , a are ( n,n—p ), (n—p,n) matrices, respectively (see §1*9, 
example (iv)). Similarly, b can be expressed as Bfi, where B, fi are 
(n, n — q ), (n — q , n) matrices, respectively. Moreover, the columns in A 
and in B, and the rows in a and in /?, severally, will be linearly indepen¬ 
dent. Hence we may write 

ab = ActBp — Ay , 

where y — aB/3. 

For definiteness suppose p > q. Then since A has only n—p linearly 
independent columns, and since the rows of y may not be linearly 
independent, the product hats at most n—p linearly independent rows 
and columns, and its degeneracy is thus at least p. Similarly, if q>p, 
the degeneracy is at least q. 

Next consider the product in the form aBfi. Since a has degeneracy 
p, there are p relations connecting the columns of a; i.e. there are p 
columns x such that ax = 0 (see § 1-9). In the extreme case, therefore, 
it is possible for p of the n — q columns of B to give null columns when 
premultiplied by a, and the remaining n—p — q columns will in fact 
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be linearly independent. Hence aB has n—p—q linearly independent 
columns, and the degeneracy of the product ah in this extreme case is 
therefore p + q. 

1-13. Reversal of Order in Products when Matrices are 
Transposed or Reciprocated, (a) Transposition . Let A be a (p, n) 
matrix and B an (m,p) matrix. Then the product P = BA is an (m, n) 
matrix of which the typical element is 

Pij = 2 BirA r j- 
r=l 

When transposed, A and B become, respectively, (n,p) and (p,m) 
matrices; they are now conformable when multiplied in the order A'B'. 
This product is an (n, m) matrix, which is readily seen to be the trans¬ 
posed of P, since the typical element is 

i A H B jr = P H . 

r=l 

Hence when a matrix product is transposed, the order of the 
matrices forming the product must be reversed. Similarly, if 

CBA = CP — R } then R' = P'C' = A'B'C'. 

It is evident that the reversal rule holds for any number of factors. 

( b ) Reciprocation . Suppose that in the equation P = BA the 
matrices are square and non-singular. Premultiply both sides of the 
equation by A-^B- 1 and postmultiply by P _1 ; then A~ X B-^ = P -1 . 
Similarly, if R = CBA , then R 1 = -4~ 1 P _1 C r ~ 1 . The reversal rule 
again applies for any number of factors. 

Examples 

(i) Transposition of Product of Square Matrix and Column, liax = 6, 
then x'a r = 6\ For example, if 

"7 4 21 rx{\ — pil, 

5 3 1 x 2 14 

3 2 lj Uj L10_ 

then [a?! x 2 x z ] F7 5 3“ = [21 14 10]. 

4 3 2 

2 1 1 . 

Both of these matrix equations represent the same set of three scalar 
equations. 



26 SYMMETRICAL MATRICES 1*13-1*14 

(ii) If A, Bare Symmetrical, then the Continued Product ABA... BA 
is Symmetrical. For if ABA... BA = Q, then 

O' = A' B'... A' B*A' = AB... ABA = Q. 

For example 

p lip 21 p 11 = p2 51 p 11 = p9 171. 

U lj [2 lj|l IJ L? 3JU lJ U7 loj 

(iii) The Reciprocal of a Symmetrical Matrix is also Symmetrical. If 
a is the reciprocal of a symmetrical matrix A, then Aa — I. By the 
reversal role a'A' = a'A = I' = I, and postmultiplication by a yields 
a = a': hence a is symmetrical. For example, 

4 3 21~ 1 =r-l 1 11. 

3 2 1 10-2 

2 1 lj 1 -2 1. 

(iv) The Reciprocal of a Skew Symmetric Matrix of Even Order is 
also Skew Symmetric. If A is a skew symmetric matrix of order n, 
then A = — A'. Hence | A | = (— l) n | A' | = (— l) m | A |. When » is 
odd, then | A | = 0, so that a skew symmetric matrix of odd order has 
no reciprocal. When A is of even order, let Aa = I. By the reversal 
rule a'A' — — a'A — I' — I. Postmultiply by a: then a' = — a, so 
that a is also skew symmetric. For example, 

[-: m- 

(v) The Product of any Matrix and its Transposed is Symmetrical. 
If P = u'u, then P' — u’u — P. For instance, 



An obvious extension is that if a is a symmetrical matrix, then the 
product u'au is symmetrical. 

1*14. Linear Substitutions. Suppose that 

Vi = u 11 x 1 +u la z a +... + u ln z n , 
y a = u ai z l +u aa x a + ...+u an z n , 


y n = «»x*x+«»a* S + ••• +«»**»• 

Then the set of variables y is said to be derived from the set a: by a 
linear transformation, and the equations are also said to define a linear 
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substitution. The whole set of equations can be represented by the 

single matrix equation „ ... 

y = wc, .(I) 

where * and y are column matrices. The square matrix u is called the 
matrix of the transformation. 

Now suppose that a third set of variables z is derived from the set y 
by the transformation whose matrix is v. Then z = vy = vux. Thus the 
transformation of x into y and the subsequent transformation of y 
into z is equivalent to a certain direct transformation of x into z. This 
is called the product of the transformations, and its matrix is the 
product of the matrices of the successive transformations taken in the 
proper order. 

When u is non-singular, equation ( 1 ) may be multiplied by u _1 to 
8 ™* x = u-y .( 2 ) 

The matrix of the inverse transformation is therefore the reciprocal 
of the matrix of the transformation. If the quantities x { in ( 1 ) are 
regarded as unknowns, then their values are given explicitly by ( 2 ). 
Hence the solution of a set of linear equations may be found by the 
calculation of the reciprocal of the matrix of the coefficients, provided 
this matrix is not singular. 

Examples 

(i) Linear Substitutions. If 

Vi = 2x 1 -2x t + 3x s , 
y t = * x + *,+ *„ 
y s = *1 + 3*,- *„ 
and Zi = Vi + 2y„ 

= - y%+ y t , 

z, = 2y 1 + 3y, + 4y„ 

then z = Tl 0 2] T2 -2 3] * = T 4 4 11*. 

0-11111 02-2 
.2 3 4j Ll 3 -IJ Lll 11 6 . 

(ii) Solution of Linear Algebraic Equations. If 

3*i—2*, — *, 7, 

2*,+2*3 + *, — 5, 

*i-2*,-3* a - 2*4 = -1, 

*, + 2*3+ * 4 = 6, 
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then 


Hence (see example (i) of § 1*11) 

1 1 -2 


■3 -2 0 -r 


X 1 

= 

- r 

0221 


X 2 


5 

1 -2 -3 -2 


*3 


-1 

-0 12 1- 


-X 4 - 


. 6. 


0 
-1 

La; 4 J L 2 


0 

3 


-4' 

-1 

6 


1 -6 -10J 


"■j 

■ r 

= 

■-10" 


5 


-1 


-1 


21 

J 

- 6- 


.-35- 


or 


*i - 10 , x 2 — — 1, £3 — 21 , 


-35. 


1*14—1*15 


1*15. Bilinear and Quadratic Forms. If x, y are two sets of n 
variables, a function A(y,x) which is linear and homogeneous in the 
variables of each set separately is called a bilinear form. Evidently 
such a function has n 2 coefficients and can be written 

n 

A{y,x) = S y r (a rl x 1 +a n x z + ...+a rn x n ) 

1 


= \yi y 2 ••• y n ] 

a ll a 12 ••• a ln 


- 1 

i 5 


a 21 a 22 • • • a 2n 


*2 


a nl a n2 ••• a nn J 


-xj 


A bilinear form can therefore be expressed concisely as 

A(y,*) = v ax = x'a'y', 

in which y is a row matrix and a; is a column matrix. 

When the sets of variables are identical, so that y = x\ the bilinear 
form becomes a quadratic form , i.e. a homogeneous function of the 
second degree. The coefficient of x i x i (i+j) in the quadratic form is 
a ij + a ji> an( l this will be unaltered if a {j and a j{ are both changed to 
\(a %j -f Hence a convenient expression for a quadratic form is 

A(x,x) = x'ax, 

where a is a symmetrical matrix. 

If we make the substitution x = hz , where h is a non-singular square 
matrix of constants, then 

A(x,x) = z'h'ahz = z'ftz, 

where b is the symmetrica] matrix h'ah. When two square matrices 
a and b (not necessarily symmetrical) are related by an equation 
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l of the type b = p'ap, where p is non-singular, they are said to be 
'connected by a congruent transformation . 

The representation of bilinear and quadratic forms* by matrices 
has two great advantages. It is concise, and it emphasises the fact that 1 

{ 'any special properties of the form correspond to special properties of 
the matrix of its coefficients. Examples of its use in relation to 
dynamics will be given in Chapter vrn. 


Examples 

(i) Bilinear Forms having Unequal Numbers of Variables . If there 
are n variables y and m variables x the appropriate bilinear form is 
A(y } x) = yax = x’a’y ', where a is an ( : h , m) matrix. For instance, 


A(y,x) = y 2 , y 3 ] 

1 , 2- 

[*T 

II 

F 

UJ 

CO 

0 

y{ 


3, 0 

U 2 . 

1 [2,0 ,-lJ 

Vi 


0,-1. 



Hz. 


= 2*2) + 3 y 2 *i- 2/3*2- 


(ii) Partitioning of Bilinear Forms. The usual rules for the par¬ 
titioning of matrices can be applied with bilinear forms. As an illustra¬ 
tion suppose 


= [Vv Vi. 2 / 3 ] 

a ll a l2 a l3 


r*i" 


®21 ®22 a 23 


x 2 


a 22 ^33. 


- X 3- 


If \y v y 2 ] = y and {x v x 2 } = x , we may write 


My,*) = \y>Vz\ 




"22 


"13 


X 


or say A(y, x) = [y, y 3 ] J a,y?j |* j 

= [*/. 2 / 3 ] Sax+fixA 
\yx + Sx 3 J 

= yax + yfix 3 +y 3 yx +y 3 Sx 3 . 

As a further example consider a quadratic form 

A{x,x) = x'ax. 

* A sim ple account of the properties of bilinear and quadratic forms is given in Chaps, 
vm to xi of Ref. 1. 
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MS-M6 


Then if we suppose * to be divided into m variables y and k variables 
y, so that x = {y, ij), we may write 

Since ois symmetrical, the submatrices a, 8 are square and symmetrical, 
and are of the orders m and k, respectively. Further, the rectangular 
submatrices ft, y are, respectively, of the types (m, k) and (k, m), while 
ft' = y. The expanded expression is 

4(*,*) = |y,9']ray+^'| 

\yy+8ri\ 

- y'aty + y’fty + rj'yy+rf8r\ 

= y'ay + r/'(f+y)y+y'8y 
= y’ay + 2ij'yy + 7j'8ri. 


1*16. Discriminants and One-Signed Quadratic Forms. If 

A(x, x)=x'axia &quadratic form of m variables x, then the determinant 
A*, b | a | is usually termed the discriminant of the form. The conditions 
that a given quadratic form A(x, x) shall be one-signed" 1 (say positive) 
for all real values of its variables are of great importance in dynamics, 
and they are connected with the signs of A m and of the discriminants 
of the forms derived from A(x,x) by omission of any number of the 
variables x. A brief discussion of this question follows. 

For simplicity assume, firstly, that none of the discriminants in 
j question is zero. This implies, in particular, that none of the principal 
I diagonal elements a ti of a vanishes. In this case all the terms involving 
the variable x 1 can be collected together as a perfect square, and the 
quadratic form can be rewritten as 

A(x,x) = aJx x + ( ^x i + ...+ < ^x n \ +£, 

where B is a quadratic form involving^all the variables with the 
exception of x v Treating B in a similar manner, and continuing the 
process, we finally express A(x, x) as the sum of m perfect squares. 

Thus, say, A{X)X) = E ^ {+Et i\+..,+E^i'Ei, ' .(1) 

where J? is a diagonal matrix the elements of which are all clearly 
rational functions of the elements of a. Moreover, the variables and 

* A quadratic form which is positive or zero for all real values of the variables is spoken 
of as a positive definite form. 
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x t will be connected by a linear transformation of the special “tri- 
angular” tvne 

14 , 14 . 

0 1 U 2Z — U 2m 

0 0 1 ... u 3m 

_0 0 0 ... 1 

and a will be derived from E by the congruent transformation a = u'Eu 
(see § 1 * 16 ). Since | u | = | v! | = 1 , it follows that 

E 1 E 2 ...E m = A^ 

Suppose next that x m — 0, so that also = 0. Then, as previously, 
we can prove that 

E x E 2 ...E m _ x = A m _ x , 


where A m _ x is the discriminant of the quadratic form obtained from 
A(x,x) when x m = 0. More generally, if x m = x m _ 1 = ... = x j+1 = 0, 
then 

E X E 2 ... Ej — Aj* 

Hence we find that 




From (1) it is clear that the necessary and sufficient conditions that 
A (x, x) shall be a positive function are that all the coefficients E shall 
be positive. Hence by (2) all the discriminants A m , A m _ 1 , ..., A x must 
be positive. We could, of course, have reduced A(x,x) to a sum of 
squares such as (1) by collecting together the variables in quite a 
different order. Each such method of reduction would lead to a different 


set of discriminants, but the final criteria are in fact all equivalent. 

The argument given above requires modification if any of the dis¬ 
criminants is zero. For example, if a n = 0 the first step in the reduction 
ofA(x, x) would fail. In this case, leaving x x and x r general and choosing 
the remaining variables to be zero, we have A(x , x) = a rr x^ + 2a lr x 1 x r . 
This expression will not be one-signed for all values of x x and x r unless 
a ^ = 0. Since this conclusion applies for all elements a lr , we see that 
if a u = 0 the form cannot be one-signed unless x x is completely absent 
from A (; x , x ). More generally, we may immediately rule out all quadratio 
forms having any zero principal diagonal elements. 

Suppose next that, while a xx + 0 , yet B contains some zero principal 
diagonal elements. By the same argument as before B (and therefore 
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A) will not necessarily be positive unless the variables in B corre¬ 
sponding to its zero principal diagonal elements are all absent. It is 
easy to show that in order that, say, x 3 may be completely absent from 
B the 5th row (or column) of elements in a must be proportional to the 
first row (or column). The procedure then is to retain the first row and 
first column of a and all other rows and columns not proportional to 
the first. The discriminants actually to be used are then constructed as 
before from the matrix a as thus reduced. In these circumstances the 
total number of squares involved in (1) will be less than m. Illustrations 
of some of the possible cases which can arise are included in the 
examples which follow. 

Examples 

(i) Discriminants all Positive . 


A(x,x) = [x v x 2 , x 3 , a; 4 ] 

■ 1-1 2 r 


X 1 


-1 3 0-3 


*2 


2 0 9 -6 


x 3 


. 1-3-6 19. 


1— 

L_ 


Here we may take = 1 and- 


1 -1 

— 2; A 3 — 

1 - 

-1 

2 

-1 3 


-1 

3 

0 



2 

0 

9 

A 4 = | 

1 -1 2 

H = 

24. 



-1 3 0-3 

2 0 9 -6 

1 -3 -6 19 

These particular discriminants are all positive, and A(x,x) is accord¬ 
ingly a positive function. It can be verified that A (x, x) is expres¬ 
sible as 

A{x , x ) = (a^ x 2 + x 4 )^ + 2(x 2 ~h #3 —'•£ 4 )^ + 3(#3 — 2 # 4 )^ 4tx%. 

(ii) Case of Proportional Rows and Columns . 


N. 

H 

II 

IT 

M 

J* 

& 

■1-1 2 r 


"*T 


-1 1 -2 -1 




2-2 7-4 


*3 


. 1 - 1-4 17 . 




Since the second row of a is proportional to the first, the variable x 2 
will be absent from B . Omitting the second row and column, and 
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distinguishing the discriminants for the modified matrix by clarendon 


type, we have 




A x = 1; A a = 1 2 =3; 

a 3 = 1 

2 

1 

2 7 

2 

7 

— 4 


1 

-4 

17 


The form in question is accordingly positive and expressible as the 
sum of three squares. It can be verified that 

A ( x , x) = (x x — x 2 + 2x 3 + x 4 ) 2 + 3(x 3 - 2x 4 ) 2 + 4a;f. 

(iii) If a — u'u, where u is Real, the Discriminants of x'ax are all 
Positive . Let y = ux, where x, y are real columns. Then 

x’ax = x’u'ux = y'y. 

The product y'y is a sum of squares and is therefore always positive. 
Hence x'ax is a positive form and the discriminants are all positive. 

More generally, if d is a diagonal matrix of positive quantities, the 
discriminants of x'u'dux are all positive. 

1*17. Special Types of Square Matrix. For convenience of refer¬ 
ence the formal definitions of one or two particularly important types 
of square matrix will now be given. Some of these types have already 
been referred to incidentally, and exemplified, in preceding pages. In 
the list which follows the matrix concerned is denoted simply as u, and 
as usual an accent means transposition. If the elements of u are complex 
numbers, the matrix with the corresponding conjugate complex 
elements—that is to say the conjugate * of u —is denoted as u. 
u — u' ... ... Symmetrical 

u = —u' . Skew Symmetric , or Alternate 

u~ x = u' ... ... Orthogonal 

u = u* ... ... Hermitian 

A few additional properties^ are given in the examples which follow. 


Examples 


(i) Symmetrical Matrices and Determinants . It is to be noted that the 
product of two symmetrical matrices is, in general, not symmetrical. 
For instance n _ r 14 17 , 

[2 3 JI 5 6J L23 28J 


* The term “conjugate” is used by some writers as meaning “transposed”, 
t For a more complete account see, for example, Chap, iv of Ref. 2. 
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If [o,^] is a symmetrical matrix of order n, the adjoint [A }i ] is also 

n n 

symmetrical. Since 2 = 0 and 2 a ir^ir ~ \ a \> we have by 

r-1 r-1 

direct multiplication 


' 1 

0 

0 

... 0 ' 

= 

An 

A 2 i 

A S i 

0 

1 

0 

... 0 


A 12 

a 22 

*^32 

<*31 

<*32 

<*33 

... a 3n 


0 

0 

|a| 

<*41 

<*42 

<*43 

... a 4n 





o»i 

<*»2 

<*»3 

••• ®nn. 


. 0 

0 

0 


Moreover, == | a j 71 " 1 . Hence 

Ml <*» 


<*33 

a 34 ••• 

<*8n 

<*43 

°44 ••• 

<*4» 

<*n3 

a n4 ... 

<*nn 




-^»2 

0 


( 1 ) 


It follows from this equation that if A 1X — 0 the two determinants 
on the left have opposite signs. Again, if both |a| and A n vanish, 
then also A 12 = 0: similarly, all other first minors of type A lr vanish. 
Accordingly, if a symmetrical determinant and its leading first minor 
/both vanish, then the first minors of all elements in the first row (or 
| column) also vanish. An application of this property to dynamics is 
f given in § 9*9. 

(ii) Orthogonal Matrices . The matrix of the direction cosines of 
three mutually perpendicular axes OX v OX 2 , OX 3 referred to three 
fixed axes is orthogonal. For, let (Z n , Z 12 , Z 13 ), (Z 21 , h* hz)> (hv h* J 33 ) be 
the direction cosines of the three axes, respectively. Then 


i = 

'*11 

hi 

hi 

and l' — 

'hi 

hi 

^31 


^21 

hi 

^23 


hi 

hi 

hi 


.hi 

hi 

^33- 


.ha 

Z 23 

^33. 


Now by the properties of direction cosines 


Ql + ^12 + ^18 ““ ^llAl ^12^22 "b ^13^88 — 0 , etc., 

so that U f = /. Hence l" 1 = V, and l is therefore orthogonal. The 
application of orthogonal matrices to the subject of kinematics will 
be considered in Chapter vm. 
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(in) Matrices Corresponding to Complex Scalars and Quaternions. If 
I denotes the second order unit matrix, and if/is the orthogonal matrix 

'■[.! i]. <2) 

then evidently I % = I , P — — I, II = II ■* /. Moreover, if 

al+bl — a'l+b'I, 

where a, a', b, b' are scalar multipliers, then a a a' and b = b'. The 
algebra of matrices of the type al + bl, where a and b are scalars, is thus 
identical with that of the scalar complexes a+ib. The complex quan¬ 
tity o+*6 is in fact here replaced by the real matrix T a 61. It will 

1-6 a\ 

be seen later that the “latent roots ” (see § 3-6) of this matrix are a±ib. 

Next suppose I to denote the fourth order unit matrix, and in 
addition introduce the orthogonal matrices 

J = roio oi; J = r o o l oi; K-r o o or. 

-1 000 0001 00 -1 0 
000 -1 -1 000 0100 
. 0 0 1 oj 0 -1 0 oj L-l 0 0 0. 

.( 3 ) 

Then P = I, P = J 2 = IP = IJK = -I, \ 

II = 11 = I, IJ = JI = J, IK — KI = K.j . (4) 

These relations provide a basis for the algebra of matrices of the type 
wl + xl+yJ+zK, in which w, x , y , z are scalars. 

The calculus of quaternions* is concerned with expressions of the 
type 1 w 4- ix +jy + kz f in which 1 is the scalar unit and i, j, k are three 
mutually perpendicular unit vectors. The laws of combination postu¬ 
lated for 1, i y jy k are precisely those given by (4), so that the two 
algebras considered formally correspond. A quaternion is thus com¬ 
pletely represented by the real matrix 


►Q 

II 

”1 

§ 

X 

y 

z " 

— X 

w 

—z 

y 

-y 

z 

w 

— X 

- — z 

-y 

X 

w. 


The “latent roots ” of q are w±i *Jx 2 -f-y 2 -f z 2 , repeated. If 
w* + x % + y 2 + z 2 s T 2 , then q'q = qq' — T 2 I. 

* See, for example, Ref. 3. 






36 QUATERNIONS 1*17 

In the algebra of quaternions q and q' would be described as “con¬ 
jugates”, while T would be called the “tensor” of q and q'. 

It may be noted that the four matrices of fourth order used in the 
preceding representation of a quaternion can, when partitioned, be 
expressed in terms of the second order unit matrix and the matrix / 
defined by (2). Hence a quaternion can also be represented as a linear 
combination of the four second order matrices 

G :]■ [: -•]■ ;]■ [• ;]■ 

where i = V— 1. 



CHAPTER II 


POWERS OP MATRICES, SERIES, AND 
INFINITESIMAL CALCULUS 


2*1. Introductory. In the present chapter we shall consider some 
of the properties of matrices which are expressible as functions of a 
given matrix or which have elements functionally dependent on real 
or complex parameters. In the latter connection the ideas of differen¬ 
tiation and integration of matrices will be developed. We shall also 
deal with certain types of infinite series of matrices (e.g. that 
defining the exponential function) which are of importance in the 
infinitesimal calculus. 

2*2. Powers of Matrices. If u is a square matrix of order n, then 

the continued product uuu ... u to m factors is written u m . In addition 

u° is interpreted to mean the unit matrix 7 n . By the associative law 

of multiplication, , • m 

r u l x u m = u m x u l = u m+/ , 

and (u m Y — (u l ) m — u™ 1 . 

Hence in the multiplication of powers of matrices the usual index 
laws of scalar algebra hold for positive integral and zero indices. 

The reciprocal matrix u~ x exists provided u is not singular, and the 
higher negative integral powers of u are then defined as the powers of 
the reciprocal. Thus = 

It readily follows that, provided u is not singular, the index laws are 
applicable for all integral indices. 


Examples 

(i) Integral Powers . If u = [3 —4], 


( 1 ) 


then 


u 2 = 13 - < 


[!::] 

and u « = (w 3 ) 2 = [7 -121 [7 -12] = T13 - 24]. 

[3 — 5 J [3 -sj L 6 -llj 


Hence also 
u 3 = uhi = uu 2 = 
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Again, the reciprocal of u is f — 1 4], so that 


2-2 


r-1 41, 

L-1 3j 


u~ 


r-i 4 

ir-i 

4] [~ 1 

[-1 a 

ij[-i 

sj L — 1 

[-3 f 

nr- 1 

41 -r 

[-2 £ 

dl-i 

3J L 


3 


■3- 


It is easy to verify by induction, or otherwise, that when a is any 


! 


integer 


[(1 + 2*), -4* 1. 

I (1 —2s)J 


•( 2 ) 


(ii) Fractional Powers. If m is an integer and if v is any square 
matrix such that t> m * u, then we may write v = u Vm and refer to v as 
an mth root of 14 . The number of mth roots which a given square matrix 
possesses depends on the nature of the matrix, and in special cases 
there may be an infinite number corresponding to any assigned value 
of m. Two simple illustrations will make this clear. Firstly, suppose 
u to be the unit matrix 7 a , and assume a square root to be 


Then we require 


« = [a p\. 

[y S\ 


v* = [a *+fly fl{a + S)' 
ly(at + t) fly + P_ 


ca 

or fly — 1 —a* = l-£* with y(a + S) = 0, fl(a.+8) — 0. 

The possible square roots are readily seen to be ± 7 t and 

£4 

with a, fl, y related by the condition fly = 1 —a* but otherwise arbi¬ 
trary. Hence in this case there is a doubly infinite number of square 
roots. 

Next let u = 


B 4 


Then it can be verified by a similar method that the only possible 
distinct square roots of u are the two matrices 


fi -a- 


It may be noted that one of these square roots is given by the formula 
( 2 ) with 8 = $. 
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MATRIX POLYNOMIALS 


A fractional power of a matrix is commutative with any other 
power, and the usual index laws are applicable to non-singular matrices 
for all real indices. The subject of fractional powers will be discussed 
further in example (iv) of § 3*9. 

2*3. Polynomials of Matrices. The typical polynomial of a 
square matrix u of order n is 

P(u) = p 0 u m +... +p m . 1 u+p m I n .(1) 

where m is a positive integer (the degree of the polynomial) and the 


coefficients p are scalar constants. It is to be noted that the final 
coefficient p m is multiplied by the unit matrix. The corresponding 
polynomial of a scalar variable x may be denoted by 

P(x) = p 0 x m +p 1 x m ~ 1 + ... +p m -iX+Pm .( 2 ) 

Evidently an identity such as 

P 1 (x)P 2 (x)mP 3 (x) 9 .(3) 

involving scalar polynomials, necessarily implies a corresponding 
identity p t(u) p Ma p M , .(4) 


for all the coefficients of the powers of a; in the expanded form of (3) 
can be identified term by term with the coefficients of the corre¬ 
sponding powers of u in (4), on account of the properties given in § 2*2. 
As a corollary, the multiplication of expressions containing only a 
single matrix is commutative. It therefore appears that the algebras^ 
of scalar polynomials of a single variable and of polynomials of a si ngle 
square matrix are completely analogous. 

Examples 

(i) Factorised Form of a Matrix Polynomial. If a v a 2 ,..., a m are the 
roots of P(x) = 0, then P(x)=zp 0 (x — a x ) (x-oi 2 ) ... ( x — oc m ). The corre¬ 
sponding matrix polynomial may accordingly be written in the 
factorised form P(u) =p 0 (u — a x I) (u — a 2 1 )... (u — a m /). 

For instance, let u = |"3, — 41 and suppose 

U-iJ 

P(u) » 3ti a —9^ + 6/ 
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In this case P(x) = 3x 2 —9a: + 6 = 3(x — 1) (x—2). Hence also 
P(u) = 3 (u — I) ( u—2I) 

= 3 p, -4] ri, -4] - 31" —2,41 = r-6,121. 
Ll,-2j[l,-3j L-1.2J L-S, 6J 


(ii) Theorem on Factorisation of P(A) I — P(u). If P(x) is the poly¬ 
nomial (2) and if A is any constant, then 

m 

P(A) — P(x) = 2 P r (X m ~ r -X m - r ). 

r=0 

The expression on the right is exactly divisible by (A — #), so that 
P(A)-P(*)«(A-*)S(A,*), 

where S is of degree m— 1 in both x and A. It follows that 
P(A) I - P(u) = P(A7) - P(u) = (A I - u) S(XI, u). 

Hence XI — u is a factor of P(A) I — P(u). 

(hi 


w ange’s Interpolation Formula . If a v a 2 , ...,a n are distinct 
but otherwise arbitrary constants, then, provided that the degree of 
P(x) does not exceed n-~ 1, 

II (<*,-*) 


vfi+r 


r-1 11 (<V 

«4=r 


«r)' 


This scalar identity is usually known as the Lagrange interpolation 
formula. It gives rise to the corresponding matrix identity 

„ n ( aj-u ) 


.(5) 


which holds good for all distinct values of the corstants a r provided 
that the degree of P(u) does not exceed n— 1. 


2*4. Infinite Series of Matrices. Let there be a equence of 
matrices u 0 , u v u 2 ,ad inf., which are all of the same oraer, and let 

p 00 

8 P = 2 u r . Then the infinite series 2 u r is defined to be convergent 

r* 0 r= 0 

if every element in the matrix S p converges to a not infinite limit as 
p tends to infinity. 

An important case is that in which the series is of the type 

8 = 2 ^ r u r , 

r=0 


(1) 
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SERIES OF MATRICES 


41 


where u is a given square matrix of order n, and the coefficients a are 
all scalar. Methods for the discussion of the convergence of such power 
series and for their summation will be given in §3*9. Sufficient, but 
not necessary, conditions for the convergence can readily be deduced 
from elementary considerations as follows. Suppose U to denote the 
greatest modulus of any of the elements of u. Then clearly the modulus 
of an element of u 2 cannot exceed nU 2 . Similarly, n 2 U z is an upper 
bound for the modulus of any element of u z , and generally 7i r ~ 1 ?7 r is 
an upper bound for the modulus of every element of u r . Hence the 
series whose terms are the moduli of any homologous elements in the 
successive terms of the power series (1) is not greater than 

cr = A q + A x U + nA 2 U 2 + n 2 AzU z +... 

= (^-)^ 0 + i (^0 + AJ + AJ *+ ...), .( 2 ) 

where 6 = nU, and A r is the modulus of a r . Accordingly, the matrix 
series (1) will certainly converge if the corresponding scalar series (2) 
converges. 

Example 


Suppose u = [* 0*15, -0*011, 

[ — 0*25, 0-15J 

and S = I + u + u 2 + u z +.... 

Here n — 2 and U — 0*25, so that 


2o* = l + l + (0*5) + (0*5) 2 +... 

= l-Hl-0‘5)- 1 = 3. 

Hence no element of 8 can exceed 1*5. The exact sum is evidently 
given by S(I — u) = /, or 




'0*85 

0*01' 

-i=i- 

0*85 

-oor 

0*25 

0*85 

0-72 

-0*25 

0*85 


The greatest element in this is 85/72 (< 1*5). 


2*5. The Exponential Function. The exponential function of a 
square matrix is defined by the same power series as the exponential 
function of a scalar. Thus 


. r u 2 u z 

exp u = / + u + 


(1) 


exp (— u) = ~I — u + 


u 2 u z 


and 
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On application of (2-4*2) it is seen that a dominant series for each of 
the series which constitute the elements of exp u is 


»—1 1 , rrv 
<r =-+ -exp(nc7), 


n 


n 


where U is the greatest modulus in u. The exponential series thus 
converges for all square matrices u. 

The scalar exponential function has the property 

e*+* = e*^, 

which is proved by direct multiplication of the two series on the right 
and identification of the product with the series on the left. Clearly, 
the result of the multiplication of the matrix exponentials exp u and 
exp v will correspond to that given by the multiplication of the scalar 
functions provided that u and v are permutable in multiplication. 
Hence 


e u e v = e v e u = 


-( 2 ) 


whenever u and v commute. In particular, this condition will be 
satisfied if u and v are polynomials of a given matrix. It should be 
noted, however, that (2) will not be valid if u and v are general matrices 
of the same order. 

From (2) it follows that 

e w e-* = e~ u e u = I. 

Hence exp u and exp (— u) are reciprocal. 


Example 


Remainder after 8 Terms . An expression for the remainder after 8 
terms of the exponential series can be derived from the known scalar 
identity 2 ^ r i 

« , - I +*+g+-+g 3 + E iJ 0 (i-')- I «”*- 

The corresponding identity for a matrix of order n is 


u 2 




w 


8-1 


«-l 


+ R 8 > 


where* 




If U is the greatest modulus of any of the elements of u, and if [1] 
denotes the square matrix of order n having all its elements unity, then 
* See § 2*10 for definition of the integral of a matrix. 
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clearly < xp (nU) [I] > 6xp (uz) for all values of z within the range of 
integration z = 0 to 1. Further, ft® -1 ?7® is an upper bound for any 
element of u 8 . Hence an upper bound for the value of R g is 

f \l-zY-'dz = <£2e"V[l]. 


v 2*6. Differentiation of Matrices. The elements of a matrix are 
often functions of a variable, say t. When it is necessary to exhibit the 
functional dependence of u on t, the matrix is written u(t) and the 
typical element is u {j (t). However, the simpler notation u and is 
usually sufficient, since the functional nature of the matrix will be 
clear from the context. 

If t receives the increment St, there will be corresponding increments 
of the elements of u , and the matrix of the increments may be written 
Su. The differential coefficient (or derivative) of u with respect to t is 
then defined by the equation 

du __ Su 
dt~ 

Hence the elements of dujdt are the differential coefficients of the 
corresponding elements of u. An immediate deduction is that 


d / 

dt {Ui+u * ] 


du-y du 2 

dt dt 


If w = UyU 2 , then 

w -f Sw = (Uy -f Suy) (u 2 + Su 2 ) 

= UyU 2 + (SUy)U 2 + Uy( Su^j + (SUy)( SU 2 ) , 

and on neglect of the second order term, this gives 


d , v (du 2 \ IduA 

i, wl + 


More generally the derivative of a continued product is formed as for 
scalar expressions, except that the original order of the factors must be 
preserved throughout . For instance, 

d q du 9 , du ~du 

and there is a similar expression for the derivative of any positive 
integral power of a matrix. Again u~ l u = J, and it readily follows that 

du -1 . du , 

~W~ u dt u * 



NOTATION FOR DERIVATIVES 


As for scalar functions, the symbol D will sometimes be used as an 

abbreviation for a differential operator such as Thus ^ == Du , and 

at dt 

d8 U 

more generally -j— = D 8 u. 

d * ( 8 ) 

The 5th derivative will also often be denoted by u, and in particular 
(i) 

Du = u. The accent notation for derivatives will be avoided, as it might 
be confused with the operation of transposition. 

In the multiplication of matrix products which involve the differ¬ 
ential operator D, care should always be taken to make sure which of 
the matrix products concerned are subject to the differentiation. For 
instance, both sides of the matrix equation Du = v can be premultiplied 
by a conformable matrix w to give wDu = wv. On the other hand, if 
both sides are postmultiplied by w, the result should be written 
(Du) w — vw and not Duw = vw. 


Examples 

(i) Differentiation of a Product . Let 


h — [cos a, sinal 
[cost, sintj 

h - r o, oi 

[ — sin t , cos t\ 


and u 2 = 


(i) 

and u 9 


[ cos t, cosal. 
sin£, sinaj 

= [ — sin t, 01. 
L cos^, Oj 


Hence, if w = u x u 2i we have 


(i) (i) (i) 

w= u x u 2 + u x u 2 


= [cosa, sinal [ — sin£, 01 + [ 0, 0 1 [cos^, cosal 

[cos t , sin t J L cos t, 0J [ — sin t f cos t\ [sin t , sin aj 

= [sin(a — t), 01 + [0, 0 1 = / sin (a — t). 

L 0, Oj [0, sin(a — <)J 

This may be verified by differentiation of 

w = [cosa, sinal [cost, cosal = [cos(a — t), 1 1. 

[cos t, sin tj [sin t, sin aj [ 1, cos (a — t)J 

(ii) Taylor'8 Theorem for Matrices . If u and v are permutable square 
matrices, the algebra of polynomials involving no other matrices than 
these (and the unit matrix) is formally identical with scalar algebra. 
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taylor’s theorem 


In this case if P(u) is a polynomial of degree m, 

(1) v 2 (2) v m (nt) 

P(u + v) = P(u) + vP(u) + nr P(u) + -.. + r~ P(u), 

I? I™ 

«> 

where P(^) is the usual ith derived function of P(^), namely, the 
matrix polynomial obtained by substitution of u for x in the scalar 
d l 

polynomial P(x). In particular, if h is a scalar 

(i) f> 2 < 2) h m < m > 

P(u + hi) = P(u) + AP(tt) + ^ P(w) +... + ^ P(u). 

(iii) Differentiation of a Function of a Matrix with respect to the 
Matrix . The differentiation of a matrix function f(u) with respect to u, 
which has already been illustrated in example (ii), can be defined more 

li-> 0 L h J 

where v is permutable with u and h is a scalar. If Taylor’s theorem can 
be applied, then 

lim 

h-> 0 L * J 


lim i 

h->0 L 


( 1 ) hv 2 ( 2 > 1 

V f( u ) + T2f( u ) + — J ^ 


(i) 

=/(«*)• 


2-7. Differentiation of the Exponential Function. If t is a 

scalar variable (real or complex) and u is a matrix of constants, the 
function exp ( ut ), defined similarly to (2-5-1), is 


exp (ut) = e ut = / + ut + - 


An application of the formula (2-4-2) shows that the foregoing series 
is absolutely and uniformly convergent for all values of t. Now the 
series obtained by differentiation of (1) term by term with respect to t is 


u 4* u 2 t + - 


- + = ue u 


Since this series is absolutely and uniformly convergent, it repre¬ 
sents the differential coefficient of exp (ut). Hence 


(&*) = ueUi — eUtu » 
at 


(2) 
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2 * 7 - 2*8 


More generally D m e ut — « m e“ < = e“V n , bo that if P(2>) is any polynomial 
of the differential operator D, 

P(D) e* = P(«) e* = e“<P(it). 

In the calculus of scalar functions 


P(D) (e«X) = e*P(Z>+ a) X, 

where P(D) is any polynomial of D, and X is any function of t. It will 
be useful to consider whether a similar rule is applicable for matrices. 
Suppose v(t) to be a matrix, not necessarily permutable with u. Then 

Z>(e“ < i>) = e vt Dv+e vl uv = e^ID+u) v. 

Again, Z>*(e“‘«>) = Pc w (/Z>+ u) v = e^ID + u) (ID + u) v. 

The result may be written 

D % (e' u v) = e^ID + «)*». 

Similarly, D m (& u v) = e' u (ID+u) m v, 

and more generally P(D) (e^v) — c“ < P(/Z)+ u) v. 

This result is valid when u is a matrix of constants, and v(t) is a matrix 
not necessarily permutable with u. When the order of the matrices is 
reversed to give P(D) (ve“ / ) there is no corresponding simple formula. 


2*8. Matrices of Differential Operators. The use of matrices of 
differential operators iB often a valuable aid to conciseness. As examples 
of branches of mathematics which naturally invite treatment by 
operational matrices we may refer to the theory of linear differential 
equations (see Chapters v, vr, vn) and to analytical dynamics (see 
Chapters vm, ix). 

To show how such matrices arise, let us suppose that we have to 
deal with a pair of first-order linear ordinary differential equations 

such as , , 

dy, ay* 

«ii-jjr+"nl/i + fu- Vi> 

dy, dy* 

v 2i-jf+v»iyi+v„-j* + u„y % = y t , 

in which the coefficients v if , u tj are given constants and are 

functions of t. These equations can be expressed without ambiguity as 

r VuD+ttu, Ui.PpM = pM. 
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DIFFERENTIAL OPERATORS 
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The operational matrix on the left may be denoted simply by f(D), 
and the equation can be abbreviated to 

f ( D)y = y . 

Similarly, if the differential equations were of order higher than the 
first, the appropriate matrix f ( D ) would have for elements polynomials 
of the operator D . 

Matrices of partial differential operators often arise in connection 
with problems involving several independent variables. If the in¬ 
dependent variables are denoted by z v x 2 ,# n , the simplest opera¬ 
tional matrices of this type are the column 

[iHhh .il- 

“ d ““ row IAK4i.£!• 

Examples 

(i) Matrix Differentiation of a Matrix Product. As an illustration of 
the matrix differentiation of a matrix product, suppose <j>(D) to be an 
operational square matrix of order 2, and let 

k == f k lx k 12 \ and M(t) = \e x ^ 0 ], 

U i k 2 J [0 e^J 

where A x , A 2 , and the elements k ij9 are constants. Then 

rf>(D) (kM) = T cj> xl (D) 0 i 2 (^)l r^u e Aa/ A: 12 l . 

e Aa ^ 22 J 

By the usual properties of differential operators this product reduces to 

T 01 l(^l) + ^12(^1) eAl ^ 21 > $^11(^2) eA *^12 ^12(^2) eA *^22 • 

*-^21(^1) ^ Al ^ll "h ^22(^1) eAl< ^ 21 > ^21(^2) gAt< ^12 ^22(^2) £ A *^ 22 - 

Hence 

< j >( D ) ( kM ) = r^nCAj) Jk 13L + ^12(^1) ^ 2 i> ^11(^2) ^12 "f ^12(^2) ^22! 

l 02 i(^-i) ^11 + ^22(^1) ^ 2 i> ^21(^2) ^12 "f ^22(^2) ^22-* 

A generalisation of this formula will be used in § 6*4 to obtain the 
solution of a system of linear differential equations with constant 
coefficients. 



48 


ROW AND COLUMN OPERATORS 


2 - 8 ~ 2-9 


(ii) Differentiation of Bilinear Forms. In applications to dynamics 
it is often necessary to differentiate bilinear or quadratic forms partially 
with respect to their variables. Let the bilinear form be (see § 1*15) 

A(y , x) = yax = x'a'y .(1) 

in which there are n variables y and m variables a?, and a is an (n, m) 
matrix. 

Firstly, differentiate A(y, x) with respect to one of the variables y, 
say y v Then 


— [1, 0, 0, 0] ax — # 12 ••• 


Hence 


Similarly, 


A = a# and 


A = a'y' and 


bs>-* v - 

{s,\ A - ya - 


Again, suppose A to be the quadratic form 

A(x,x) = x'ax, .(2) 

in which the square matrix a is now assumed symmetrical (see § 1*15). 
Then , ^ x ^ , 


|) 4 = 2 « and IJ- 

0Xi) L ox j-. 


A = 2 x'a. 


If the variables x, y and the coefficients a {j in (1) are all functions of 
a parameter t , and if the differentiation is with respect to t, then 

dA dy da dx 

*-*“ +i 'S a ’ +!w a' 

while if A is the quadratic form (2) 

dA dx’ ,da , dx n dx' ,da 
— = —~— ax + x-^-x + x a — = 2 ax 4 -x — x. 
x dt dt dt dt dt dt 

^ 2-9.* Change of the Independent Variables. The usual 
/ formulae of the differential calculus for the change from a set of n 
independent variables x to another set y are 

JL = y fyjjL 

dx { dx t dy } * 

in which i — 1,2, n. These relations can be expressed conveniently as 


(U—1£J- 
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in which a! is the transposed of the matrix 
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tyl 

fyi 

9^i 

dx x 

9*2 " 

' 9x„ 

3y 2 

9^2 

9y 2 

dx x 

9*2 

9*71 

tyn 

Hn 


dx± 

9*2 

9*71 


Similarly, 


where b is the matrix obtained when x and y are interchanged in a. 

From (1) and (2) it follows immediately that j^-J = a'b' |^-|, 
so that a'b' = I; hence ba = I — ab. These yield the familiar rela¬ 
tion |a||6 | = l between the Jacobians | a | es ^ v ^ 2) and 

I b I s d(*i,* 2 , it follows also that the adjoints A, B of a, b, 

respectively, have the properties 

A —b\a\\ B = a\bl, AB = I. .(3) 


A further property is given by a theorem due to Jacobi, which 
states in effect that 


■x— A = 0 and 5 = 0. 

\_ dx )J \JVjJ 


For instance, if n = 3, and if A 1V A lz , A 1S are the cofactors of the 


elements in the first row of a, then 




^ 12 ^ 13 } g ives 


JL l ty* d /% 2 dy 3 dy z dy 2 \ 3 tdy 2 dy 3 dy 3 dyA 

dx x \3# 2 dx 3 dx 2 3 xj dx 2 dx x dx 3 dxj + dx 3 \3x 1 dx 2 dx x dxj * 

and this vanishes identically. In the same way it can be shown that 

the other columns of A are annihilated by the operator ill- 

truth of the general theorem can be verified on similar lines. 

In the transformation of expressions which are quadratic in the 
differential operators, it is necessary to use in addition to the sub- 
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stitution (1) a substitution for the operational row bll Now a 
direct transposition of equation (1) leads to the substitution 


UJ" UJ°°’ 


.( 5 ) 


where the cipher suffix indicates the restriction that a is to be un¬ 
affected by the differentiations 3/3This restriction, which is very 
inconvenient, can be removed by the following elegant modification 
of the substitution due to Smith.* 

Let {B r J denote the rth column of the adjoint matrix B, and suppose 
<j> to be any scalar function of the variables. Then by (4) and (2) 

LiK )fS ■ B ''w l +B,, w t + '" +Br 'd^ 




Since bB = | 6 | J, and therefore also B'b' = | b | /, the last expression 

Pj JL 

reduces to | b | Accordingly by (3) 


so that 




Since <j> is arbitrary and | a 11 b | = 1 we obtain, as an equivalent to 
(5), Smith’s transformation 


L4l- | “ i lid l ‘ ,| " 1< ‘- 


.( 6 ) 


Examples 


(i) Linear Partial Differential Equation of the First Order. If the 
given differential equation is 


d<j> dd> 




d<f> . 

■ +u 'wr* 


* Ref. 4. 
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where u v u 2 , ...,u n and ^ are given functions of the n variables x, 
the matrix equivalent is ( g 


1 %I 


ill*-*' 


When transformed to new variables y the equation becomes by (1) 

(ii) Transformation of the General Laplacian Operator V 2 . The general 
Laplacian operator in n variables is defined to be 


™ 0 2 0 2 0 2 

V 2 =_i_-i_ 

dx\ dx\ dx 2 


l L^J \3«il 


After transformation to new variables y this operator becomes by (6) 
and (i) | a | O) 

vi -i“iyi«i—' y- 

In the important special case where y x — const., y 2 — const., etc. 
are orthogonal, the product aa r reduces to a diagonal matrix. 

As an illustration let us consider the transformation of the usual 

02 02 02 

three-dimensional operator = + to spherical polar 

cx £ dy* dz* 

coordinates r, #, The relations between the variables are here 

x = rsin#cos^; y = rsin#sin0; z — rcos#. 

Assuming x , y, z and r, 6, to correspond, respectively, to x v x 2 , x 3 
and y v y 2i y 3 , we have 


b = 


dx 

dr 

dy 

dr 

dz 

dr 


dx 

dd 

dy 

dd 

dz 

dd 


dx' 

00 

dy 

d<t> 

dz 

00. 


sin d cos 0 r cos d cos 0 
sin d sin 0 r cos d sin 0 
cos# — rsin# 


— rsin# sin0 
r sin# cos 0 
0 


This readily yields | 6 | = | a | _1 = r 2 sin#, and 


a = = 


so that 


1 


r 2 sin# 


r 2 sin 2 # cos 0 r 2 sin 2 # sin 0 r 2 sin # cos #' 

r 2 sin # cos # cos 0 r sin # cos # sin 0 —r sin 2 # 

— r sin0 rcos0 0 


aa = 


r 2 sin 2 # 


V 2 sin 2 # 
0 
0 


0 

sin 2 # 

0 
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The transformed operator is accordingly 


V 2 = 


i ra A A 

r 2 sin#L^r , 3#’ 30 J 


> 2 sin# 
0 
0 


0 

sin# 

0 


1 3/* d\ 1 3 / . . 3\ 

r 2 dr \ dr) + r 2 sin# 3# \ m 3#/ 


0 
0 

1 /sin# 

1 3 2 


lf_3 d 

\dr’ 3#’ 


_3 

30 


r 2 sin 2 # 30 2 * 


2-10. Integration of Matrices. When the elements of u are 
functions of a variable t, the integral of u with respect to t taken between 
the Hmits t 0 and t is defined to be the matrix which has for its (i, j)th 

element f u^dt. The integrated matrix is often written Q[ t u. When 
J t* 

no uncertainty can arise as to the range of integration, Q\ 9 may be 
abbreviated to Q . 

In defining the integrated matrix Qu we have tacitly assumed that 
the elements u are continuous functions of a real variable t. The 
definition applies equally to a complex variable t, but in this case the 
variable must be viewed as represented in an Argand diagram (£-plane), 
and the integrations of the elements must be performed along a suitable 
path or curve connecting t 0 and t in that diagram. The path of in¬ 
tegration can be chosen arbitrarily, apart from the restriction that it 
does not encounter certain £< barriers’' which can be constructed from 
a knowledge of the singularities of the elements. These barriers may 
be taken as the straight line continuations to infinity of the radii 
joining the point t 0 to the singularities.* 

An upper bound to the elements of an integrated matrix can readily 
be found as follows. Suppose U to be the greatest modulus of all the 
elements in u(t) y so that for all points t of the range (or path) of integra¬ 
tion, | | < U. Then, if t is a real variable, by a known property 

of integrals „ t t 

I 'M'ijty) ^ I | | dt ^ U(t t 0 ). 

J 1 0 J i 0 

Hence Q\u ^ U(t - 1 0 ) [1], 

where [1] is here used to denote the square matrix of order n having 
units for all its elements. More generally, if t is complex, let r be a 


♦ The singularities of any element are the points at which that element ceases to be an 
analytic function of t, i.e. at which the element fails to satisfy the conditions of being single- 
valued, continuous, and of having a unique derivative. The system of “barriers” referred 
to is spoken of as a “Mittag-Leffler star” (see for instance § 16*5 of Ref. 5). 
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current point on the path of integration, cr the arc of the curve up to 
the point t, and 8 the total arc from t Q to t. In this case 


so that 


u if (T)dr <| u^t) dT\^U^dcr^U8 9 

Q* f u^Us[ 1]. 


2*11. The Matrizant. A brief discussion will now be given of series 
of a special type which are built up by means of repeated integrations 
of a given matrix u(t). These series have their origin in the theory of 
systems of linear differential equations of the first order (see § 7*5), 
and to illustrate how they arise naturally, we shall begin by con¬ 
sidering the single scalar linear differential equation 

For simplicity assume the variables to be real, and suppose that a 
solution is required such that y = y 0 at t = t 0 . Direct integration with 
respect to t yields » t 

y{t) = 2/o+J ( f(Ti)y( T i)dT V .(i) 

in which t x is a subsidiary variable. For the particular value t = r l9 
this equation gives - 

y(r i) = y 0 +J t 7( t *) yfa) dT 2 > ( 2 ) 

where the subsidiary variable is now written r 2 . On substitution for 
y(7 X ) from (2) in (1) we obtain 

y{t) = y 0 + 2/o f f('r 1 )dr 1 + f /fo) f 'f(r 2 ) y(r 2 ) dr t dT v 
J to J to J tg 

and continued applications of (2) lead to the result 

y{t) = yMU), 

where 

^.(/) = l+ f f(Tl)dT 1 + f /(Tj) f f(T 2 )dT 2 dT 1 

Jt. Jt. Jt. 

+ f f( T i) f 1 fir2) f ‘f(T 3 )dT 3 dT 2 dT 1 + .... 

J to J to J to 

In the notation of § 2-10, this series would be written more concisely 

“ Qf.(/)*i +Q t t.f+Ql.fQ t J+-, 

or even more simply as 

£}(/) = 1 + Qf+ QfQf+QfQfQf +.... 
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A correct interpretation of this series is important. The meaning of 
the first integral Qf is obvious. To obtain the next term QfQf we first 
form the product fQf , and then, treating t in that product as a current 
variable, integrate between the limits t 0 and t. The succeeding terms 
QfQfQfy etc. are formed in a similar manner. The series formally 
satisfies the given differential equation and the assigned condition 
y = y 0 at t = t 0 . However, before it can be accepted as a genuine 
solution, its convergence must be established. The proof is very simple. 
It is assumed that, for all values of t concerned, a positive number A 
can be chosen such that | f(t) \^A. Then Qf^A(t — t 0 ), so that 

QfQf<A^ (t-t 0 )dt< A2(t ~ to) \ 

and so on. Hence 

Q(/) 1 +A(t-to )+ + ■. ■ 

This shows the series to be absolutely convergent. 

Suppose now that we had commenced with a system of n simul¬ 
taneous first-order linear differential equations instead of with a single 
equation. Such a system of equations can be written concisely as 

i-“('>»• 

Here y denotes the column of n dependent variables, and u is a square 
matrix of order n having for elements given functions of t. Direct 
integration gives rt 

V(t) = V(t 0 ) + u(r 1 )y(T 1 )dT v 

where y(t 0 ) is the column of values assigned at t = t 0 . A process of 
repeated substitution and integration, precisely analogous to that 
already described, leads to the formal solution 

y(t) = £l(u)y{t 0 ), 

in which Q(w) is the matrix series 

Q(u)==I n + Qu + QuQu+ QuQuQu+.... .(3) 

This expression is called* the matrizant of u. Care should be taken to 
interpret the meaning of the series correctly. The first term is the unit 


* The term is due to Baker; see p. 335 of Ref. 6. 
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matrix of order n , and the second term is the integral of u taken between 
the limits t Q and t. To obtain Qu Qu , we multiply u and Qu in the order 
uQu y treat t as current in the product matrix, and again integrate 
between the limits t 0 and t. The remaining terms are formed in succes¬ 
sion in the same way. 

The proof of the convergence of the matrizant follows the lines 
adopted for the scalar series. The variable t will be assumed to be 
complex, as no additional complication is thereby involved. The 
integrations are then supposed effected along paths in the Argand 
diagram (£-plane) which do not encounter any of the barriers referred 
to in § 2*10. In the region of the £-plane under consideration let U tj be 
an upper bound for the modulus of the typical element u^t), so that 
I u a I ^ farther, let U be a positive number such that ^ U for 
all the elements. Then, in the first place, 

Qu ^ J7*[l], 


where s is the arc of integration from t 0 to t. Similarly, 


Qu Qu 


< 


uQudt ^J £7[1] f7$[l]cfe, 


or 




In general, for m repeated integrations 

^ ^ ^ n m ~ l U m s m r „ 

QuQu... Qu <-:-[1]. 


Hence 


1 m 

r\/ \ r tt m nU 2 s 2 _ n 2 <7V ri _ 
fi(^) ^ /+ Us[l] H—— [1] H-— [1] +.... 


I! 


1 » 


Each of the series comprising the elements of Cl(u) is accordingly 
less than 

s _n-l+e nU * 


n 


It follows that the series in question are absolutely and uniformly 
convergent in the part of the Z-plane under consideration. 

By differentiation of the series (3) it follows that 


~Q(u) = uQ(u). 

Other properties of the matrizant are given in § 7-6. 


( 4 ) 
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Example 

The Matrizant of a Matrix of Constants. Suppose u to be a matrix of 
constants. Then 

Qu = uj dt — u(t-t 0 ); QuQu = u 2 j ( t-t 0 )dt = t etc. 

Hence in this case 

Cl(u) = I+u(t-1 0 ) + 


|2 


+etc. = 


Equation (2*7*2) is a particular case of (4). 



CHAPTER III 


LAMBDA-MATRICES AND CANONICAL FORMS 

3*1. Preliminary Remarks. Part I of the present Chapter gives 
an account of some of the properties of matrices which have for their 
elements polynomials of a parameter. The study of matrices of this 
type may be regarded as preparing the way for the applications to 
linear differential equations considered in Chapters v and vi. In Part II 
the important conception of the equivalence of matrices is explained, 
and a brief review is given of some of the canonical forms to which 
matrices can be reduced. 


Part I. Lambda-Matrices 

3*2. Lambda-Matrices. A square matrix* /(A)—or more briefly 
/—the elements f 4j ( A) of wliirfc are rational integral functions of a 
scalar parameter A, is referred to as a A -matrix. If N is the highest 
degree in A of any of the elements then / is said to be of degree N. 
Such a matrix can evidently be expanded in the form 

/ = A 0 A^-f -f... 4- Ajy^A-f A#, 

where A 0 , A v etc. are matrices independent of A. A A-matrix is accord- 
ingly a polynomial in A with matrix coefficients. The matrix A 0 will 
be spoken of as the leading matrix coefficient . 

The rank of a A-matrix is defined as follows: If at least one of the 
minor determinants of / of order r is not identically zero, whereas all 
the minors of order greater than r do vanish identically, then / is of 
rank r. 

It should be noted that in the foregoing definition the stated con¬ 
ditions must be satisfied for a general value of A. If particular values 
of A are assigned, a A-matrix having rank r in accordance with the 
definition may actually acquire a rank less than r in accordance with 
the definition of § 1*9. 


* Rectangular A-matrices will not he considered here. 
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Examples 

(i) Non-Singular A-Matrices. 

Order 3, degree 1: 

/ = [2A — 2, A + 2, -3] = [2,1, 0]A+[-2, 2,-3'. 

3A-1, A, -1 3, 1, 0 -1, 0,-1 

-1, 4A-3,-A+lJ L°> 4 .-lj L — —3, 

Order 2, degree 2: 

/ = [7A 2 - 34A +102, 3A 2 - 6A + 281. 

L 3A 2 — 18A + 50, 2A 2 -2A+15J 

(ii) Singular A-Matrices. 

Order 3, degree 3, rank 2: 

f = A, A, A 2 +1 
A 2 -1, A 2 +1, A 
A 3 , A 3 + 2A, 2A a +1 

Here | /1 =0, but the first minors do not all vanish identically. 

Order 3, degree 2, rank 1: 

/ = [A 2 , 2A 2 ,3A 2 ] . 

A, 2A, 3A 
.1,2, 3 . 

3*3. Multiplication and Division of Lambda-Matrices. Let 

/, g denote two A-matrices of equal order and of degrees N and M, 
respectively. If their typical matrix coefficients are A { , Bp respectively, 
the product gf formed by the usual rules is 

/ M \ / N \ M+N 

9f = S B } A M ~n S = S C k A*+»~\ 

\i-o / \i-o / k-o 

where C 0 = B 0 A 0 , 

Ci — B 0 A 1 -|- B 1 A 0 , 

and generally C k = 2 BiAs. 

i+f~k 

The product is therefore a A-matrix of degree N+M provided C 0 is 
not null. A similar conclusion is valid for the product fg. 

The reciprocal matrix / -1 —the elements of which are obtained by 
division of the appropriate cofactors in / by | /1—is not, in general, a 
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A-matrix. A special case of importance in applications to differential 
equations (see Chapter v) is where | /1 is independent of A and not zero. 
The reciprocal/ _1 then is a A-matrix. 

From the preceding remarks it is seen that normally a product such 
as g* 1 /, where both/and g are A-matrices, will not be a A-matrix. We 
shall now show that, if B 0 is not singular, it is possible to express the 
result of this predivision in the form 

g-'f^Q+g-'R, .(l) 

where the quotient Q and the remainder R are A-matrices, and the 
degree of R is lower than that of g. A convenient alternative to (1) is 

f^gQ+R- .( 2 ) 

To establish (1) we note firstly that if the degree M of g exceeds N, a 
solution of (2) which satisfies all the stated conditions is Q = 0 and 
R = /. Dismissing this trivial case we proceed on the assumption 
that M < N. The identity to be satisfied is then of the form 

A 0 \ n + A 1 A n ~ 1 +...+A n _ 1 A + A n 

= (£ 0 A *++ ... + B m ) (Q 0 \s-m +Qa n-m-i +... + q n _ m ) 

+ R 0 A M ~ 2 +... + 

Equating the coefficients of the successive powers of A we thus require 

~ BoQo> 

Ai = I?o0i + i*i0o, 


AN-M “ Bq Qn-M + *®1 Qn-m-i + • • • + Bn-M Qo> 
together with 

A n-m+1 = B 1 Qn-m + B z Qn-m-i + • • • + B N -m+i Qo + Ro> 
^n-m+2 — B 2 Qn-m + B 3 Qn-m-i + • • • + B N _ M+ 2 Q 0 + R v 

An = B m Qn-m + Rm-v 

.( 4 ) 

Provided B 0 is not singular, the matrix coefficients Q t of the quotient 
are uniquely determinable in succession by (3), and those Rj of the 
remainder are then given uniquely by (4). 

In a similar way it can be proved that, if B 0 is not singular, a unique 
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3*3-3*4 


quotient Q and a unique remainder R of degree less than that of g 
can be found such that 

The operations expressed by equations (2) and (6) are sometimes 
referred to as division on the left and division on the right, respectively.* 


f=Qg+R .(6) 


Example 

Division on the Left and on the Right. Suppose 

/= pA-2,A+21 and g = rA 2 +1, AT. 

1.3A-1, A J L 3, lJ 

The degrees of f and g are respectively 1 and 2. The identities corre¬ 
sponding to (2) and (5), with / placed on the left of the equation in 
each case, are both trivial, since M>N. On the other hand, if/and g 
are interchanged, the appropriate identities are respectively 

rA*+l,Al = r 2 A — 2 ,A + 2 ir—A + 7, - 11 + T59, -81, 

L 3, 1J I.3A-1, A JL3A-22, 3 J L10, OJ 
rA a +l,A] = r-A+10,A-71 r 2 A- 2 ,A + 2'|-t-ri4, - 201 . 

L 3, lJ L 0, 0 JL3A-1, A J L 3, 1 J 

3*4. Remainder Theorems for Lambda-Matrices. Let /be as 

defined in § 3*2, and suppose u to be a square matrix of the same order 
with elements independent of A. Write 

fi(u) = u N A 0 + u N ~ 1 A 1 + ...+A n , 

so that /(A) -Mu) = "x t(A/)*-* - u N ~<] A { . 

i-o 

Now 

(A/)*-* - u N ~i = (A/ - u) (IA 1 *-*- 1 + ttA^- 2 +... + ux-i- 1 ). 
Hence A/ — u is a factor (on the left) of/(A) —fi(u), and we can therefore 
Wlite f(A)-Uu) = (AI-u)Q(A), 

where Q(A) is a certain A-matrix. Accordingly Mu) is the remainder 
when /(A) is divided on the left by A/ —u. 

Similarly, it can be shown that when /(A) is divided on the right by 
A I—u the remainder is 

Mu) = A 0 u N +Aj^u 1 *- 1 + ...+A N . 

An important application of these remainder theorems is given in 
§3*7. 


* See §2 03 of Ref. 7. 
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3*5 v THE DETERMINANTAL EQUATION 

3*5. The Determinantal Equation and the Adjoint of a 
Lambda-Matrix. Let A(A) denote the determinant of a A-matrix /(A) 
of order m, so that 

A(A)a /u(A) U A) ... / lm (A) . 

/21(A) /22(A) ... f 2m (A) 

/ml(A) / m2 (A) ... / mm (A) 

Then A(A) = 0 is termed the determinantal equation, and its roots are 
denoted by A 1( A 2 ,..., A n , where n is the degree of A(A) in A. These roots 
are not necessarily all distinct, but it is convenient to specify them by 
n distinct symbols. 

The adjoint F( A) of/(A) is itself a A-matrix. It has the properties 
/(A)J’(A) = J'(A)/(A) = /A(A), V 

and differentiation p times with respect to A gives 
d p d? <*>> 

.in 

The products in (1) can if necessary be expanded in terms of F(A) and 

( 1 ) ( 2 ) 

the derived adjoint matrices F(A), F( A), etc. 

The following theorems are of importance: 

(A) The matrix /(A s ) obtained by substitution of any root \ 8 for A 
in /(A) is necessarily singular. When A s is an unrepeated root, /(A s ) is 
necessarily simply degenerate. 

(B) When/( A 8 ) has degeneracy q , at least q of the roots A x , A 2 , 

A n are equal to A s . 

(C) The matrix f(A s ) is not necessarily multiply degenerate when A s 
is a multiple root. 

(D) When f(A s ) is simply degenerate, the adjoint F(A 8 ) is a matrix 
of unit rank and is expressible as a product of the form 

F(Ag) = •••> ^2 •*•> ^ms\ = 

where k u , K is are constants appropriate to the selected root, and at 
least one constant of each type is not zero For conciseness the fore¬ 
going relations will often be written 

-*• K K 8‘ 

When this abbreviated notation is used the column matrix k 8 and the 
row matrix k 8 will be understood to be appropriate to the root A,. 
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(E) When f(X 8 ) has degeneracy q, where q> 1, the adjoint matrix 

2) 

jF(A) and its derivatives up to and including F(X) at least are all 
null for A = A g . 

Theorems (A) and (B) follow at once from the results given in 
(i) 

example (i) of § 1*8. Since A(AJ + 0* for an unrepeated root A*, and since 

(i) 

A(A) is linear and homogeneous in the first minors of A(A), these minors 
cannot all be zero for A = A s . Hence/(A S ) is simply degenerate (Theorem 
A). When /(A s ) has degeneracy q } so that all the (q — l)th minors of /(A) 
vanish when A = A a , then all the derivatives of A(A) up to and including 
<«-D 

A(A) are zero for A = A s . This implies that A g is at least a g-fold root 
of A(A) (Theorem B). 

u> 

With regard to Theorem (C) it is obvious that A(A), for instance, can 
vanish for A = A s without all the first minors in A(A) being zero 

Theorem (D) is easily proved on reference to § 1*12. For if/(A a ) is 
simply degenerate, F(X a ) by definition cannot be null. Hence, since 
the product f(X 8 ) F(X 8 ) —which equals A(A g ) I —is null, F( A g ) must be 
a matrix of unit rank. It is therefore expressible as a product of the 
form stated (see example (iv) of § 1-9). 

Lastly, Theorem (E) is obvious from the results given in example 

(ii) of § 1-8, which show at once that if f(X s ) has degeneracy q then 
(A — AJ® -1 at least is a factor of every first minor of A(A), and is thus a 
factor of F( A). Clearly in this case the derived adjoint matrices up to 

(a-2) 

and including F(A) at least are null when A = X 8 . 


(i) Distinct Roots . If 


Examples 


/(A) = 


"A— 1 
0 
0 


0 

A-2 

0 


0 1’ 
0 

A —3 


then A(A) = (A-1) (A —2) (A-3), so that X t = 1, A 2 = 2, A s = 3. Here 
/(A) is simply degenerate for each root (Theorem A). The adjoint, for 
a general value of A, is 


F(A) = 


(A —2)(A —3) 0 0 

0 (A —3) (A — 1) 0 

0 0 (A-l)(A-2). 


* 


A(A,) is an abbreviation for 
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^(A x ) = p 0 01 ={2,0,0} [1,0,0], 

0 0 0 

0 0 0. 

Similarly, F( A 2 ) = {0,1,0} [0, -1, 0], 

F( A 3 ) = {0,0,2}[0, 0,1]. 

These results illustrate Theorem (D). 

(ii) Repeated Roots and Simple Degeneracy. Let 

/(A)= rA -2 -1 O'. 

0 A —2 0 

0 0 A —3. 

Then A(A) = (A - 2) 2 (A - 3), giving A x = 2, A 2 = 2, A 3 = 3, and for each 
root /(A) is simply degenerate. The adjoint is 

J’(A) = T(A —2)(A—3) (A —3) 0 1, 

0 (A-3) (A-2) 0 

0 0 (A—2) 2 

so that F{ A x ) = F(A 2 ) = TO -1 0] {1,0,0}[0, -1, 0]. 

0 0 0 

0 0 0. 

The results exemplify Theorems (C) and (D). 

(iii) Repeated Roots and Multiple Degeneracy. Suppose 

/(A) = rA-2 0 0 ], 

0 A — 2 0 

0 0 A —3. 


which again yields A(A) = (A — 2) 2 (A — 3) and A x = 2, A 2 = 2, A 3 = 3. 
However, in this case /(A 3 ) has degeneracy 2 and the adjoint F(A 2 ) is 
obviously null. This exemplifies Theorems (B) and (E). 

As a further simple illustration assume 


fW — 


'1 

0 

0 

O' 

0 

A 

0 

0 

0 

0 

A 2 

0 

.0 

0 

0 

A 3 . 
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Then A(A) = A*, and/(A) has degeneracy 3 for the sextuple root A = 0. 
With A left general the adjoint is 


jF(A) = A 3 


A* 

0 

0 

L0 


0 

A* 

0 

0 


O' 

0 

0 

1J 


dF d 2 F 

so that F( A), ^ and are all null when A = 0. 


3*6. The Characteristic Matrix of a Square Matrix and the 
Latent Roots. If u is a given square matrix of order n with constant 
elements, then the matrix 




•(1) 


is called the characteristic matrix of u. This very simple and important 
type of A-matiix has a natural origin in applications of linear sub¬ 
stitutions relating to the conditions under which a set of transformed 
variables y is proportional to the original set of variables x. If the 
linear substitution is given by y — ux, and if this relation is to be 
satisfied by, say, y = Ar, where A is a scalar factor of proportionality, 
then wo require (A/ -«)*=/{ A) a; = 0. .(2) 

In order that this equation may be satisfied by a column x which is not 
null, it is necessary that 

A(A) = | \I—u \ = 0. .(3) 

The determinant A(A) in this special case is usually referred to as the 
characteristic function of the matrix u, and A(A) = 0 is the characteristic 
equation . Moreover, the n roots A x , A 2 , A n of A(A) = 0 are called 
the latent roots* of u. Theorems (A) to (E) of §3-5 are, of course, 
directly applicable to the characteristic matrix /(A) and its adjoint 
F(A). 

Suppose now that the latent roots have been found, and let a solution 
of (2) appropriate to the typical root A 8 be denoted by 

Such a column of quantities will for convenience be spoken of as a 
modal column appropriate to A g . 

Now equation (2) requires that 


(KI-u) X (\ 8 )^f(\ 8 )x(\ 8 ) = <). .(4) 


* Sometimes also called the characteristic numbers of u. 
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If /(A,) has degeneracy q, this equation will have q linearly distinct 
solutions (see § 1-9). 

Firstly, assume A s to be a simple root. Then by Theorems (A) and 
(D) of § 3-5/( A a ) is simply degenerate and F( A s ) is a matrix product of 
the type k a K s in which neither the column k a nor the row K a is null. 
But since/(A,) F( A a ) = F(A s )/(A g ) = 0, it follows that f(A a ) k e K a = 0 and 
k a Kgf(A a ) = 0, and these obviously require that 


/(A»)*, = 0, .(5) 

*J(K) = 0. .(6) 


On comparison of (4) and (5) we see that the modal column x(\ a ) can 
here be taken proportional to any non-zero column of the adjoint 
F(\ a ). 

Next suppose A 8 to be one of a set of 8 equal latent roots. Then if 
/(A 8 ) is simply degenerate the modal column can again be chosen pro¬ 
portional to a non-vanishing column k 8 of the adjoint, and this will be 
the only solution corresponding to the whole set of 8 equal roots. If, 
on the other hand, /(A 8 ) has degeneracy q > 1, the adjoint F( A 8 ) is null 
by Theorem (E). Theg linearly distinct solutions must then be obtained 
from columns of such of the derived adjoint matrices as are not null. 
To illustrate the method suppose /(AJ to have degeneracy a, in which 

(•-2) 

case the derived adjoint matrices up to and including i^(A) are all 
null for A = A 8 . Putting p = a — 1 in (3-5*1), expanding the first pro¬ 
duct, and substituting A = A 8 , we have 

<*-i) 

/(A a )^(A*) = 0. 

( 9 - 1 ) 

It can be shown that the matrix F(X a ) is in this case of rank 8 and 
therefore a matrix product of the type fe a x 8 , where the 8 columns of 
the rectangular matrix k 8 and the 8 rows of x 8 are all distinct (see § 1*9). 
The s columns of k 8 can be chosen as the required modal columns. 

In the important special case where all the latent roots of u are 
distinct, there is a distinct modal column k 8 and a corresponding row 
k 8 appropriate to each root. The n modal columns, taken for all the roots 
A x , A 2 ,A^, form a square matrix 

k=z k n k 12 ... k ln , 

k 2 i k 22 ... k 2n I 

-^nl ^n2 •■* ^»n- 


( 7 ) 
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which will be termed fche modal matrix . Since the modal columns can 
be taken in arbitrary multiples, the matrix k is indeterminate to the 
extent that it can always be postmultiplied by a non-singular diagonal 
matrix of constants. It can be shown without difficulty that k is 
necessarily non-singular. 

In the same way the n rows tc 8 can be arranged to form a square 

m * tri ** - - .( 8 ) 


K~ 

*11 

*21 

• • • *nl 


*12 

*22 

... K n2 


-*1» 

*2n 

• • • *nn_ 


By means of the modal matrix the n matrix equations (A* I— u)k 8 = 0, 
taken for all the latent roots, can be combined into the single matrix 
equation 


uk = 

Ai&n 

^- 2^12 

^n^ln 


Ai& 2 i 

^•2^22 ••• 

A n ^2 n 


-Ai&nl 

A2 k n2 



This can be written concisely as 




uk — kA, 


where 

As 

0 ... 

°1 



0 A 2 

0 



.0 0 ... 

A„. 


■•■(») 

.( 10 ) 


denotes the diagonal matrix of the latent roots. Since k is non-singular, 
equation (9) gives # = kAk - K . (11) 


When two square 
of the type 


u = pvp- 1 , 


where p is a non-singular square matrix, they are said to be connected 
by a collineatory transformation . Equation (11) states that a square 
matrix which has all its latent roots distinct is reducible by a col¬ 
lineatory transformation to the diagonal matrix of its latent roots. 
This particular transformation is often very useful. For instance, 
when u is expressed in the form (11), 


it 2 = kAk-'kAk- 1 = kA 2 k~K 


* For convenience the second suffix of the elements k u and k u indicates the particular 
latent root with which the elements are associated. With this convention the suffices in 
the square matrix k appear transposed. 










3-6 
Similarly, 


COLLINEATORY TRANSFORMATION 

I*" 1 , 


u m = kA m k ~ 1 = k 

0 ... O' 

0 A? ... 0 



. 0 0 ... A£. 

and more generally, if P(u) is any polynomial of u 9 

P(u) = kP(A) k- 1 = k 

Pfri) 

0 

0 ... 0 

P(A 2 ) ... 0 


_ 0 

0 ... P(AJ 


Jr 1 . 
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.( 12 ) 


In particular, we may choose P(A) to be the characteristic function 
A(A) == | A/ — u |. Then, since A(A) vanishes when A is any latent root, 
equation (12) reduces to A(u) = 0. The matrix u thus has the im¬ 
portant property that it satisfies its own characteristic equation. 
In §3*7 it will be shown that this holds generally for any square 
matrix. 

If there are s roots equal to A fl , and if f(\ 8 ) is simply degenerate, 
equation (4) yields only a single modal column corresponding to the 
whole set of 8 roots. It is not then possible to reduce u to the diagonal 
form. However, the reduction to the form (11) still holds good if for 
every set of s equal roots A g the degeneracy of /(A s ) is s , i.e. if /(A g ) has 
“full degeneracy” for every set of repeated roots. For in that case 
there are 8 distinct modal columns for each set of repeated roots— 
namely, those of the (ni,8) matrix k 8 —and the modal matrix k is 
composed, as before, of the n modal columns taken for all the latent 
roots. Similarly, there is a matrix k composed of the rows of the 
matrices x g . 




Examples 
/ (iy Latent Roots all Distinct . Suppose 

f2 -2 31. 


1 1 
1 3 

Then the characteristic matrix is 


/(A) = A/—m = 


A —2 
-1 
-1 


2 

A— 1 
-3 


- 31 . 
-1 
A+l. 
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The characteristic equation is 

A—2 2 -3 = A 3 —2A 2 —5A+6 = (A—1)(A+2)(A—3) = 0. 

-I A-1 -1 

-1 -3 A+l 

The latent roots are therefore A 2 = 1, A 2 = — 2, A a = 3. 

When A = Aj in /(A), we have 

/(A 1 )=r-1 2-3] with J’(A 1 )=r-3 5-2". 

-10-1 3-5 2 

.-1 -3 2} 3 -5 2 


Hence we may choose = { — 1,1,1}. 

Similarly k 2 = {11, 1, —14} and k z = (1,1,1}. The modal matrix 
may accordingly be taken as 


1 

11 

1] , giving k~ x = ^0 

-15 

25 

-10 

1 

1 

1 

0 

2 

-2 

1 

-14 

lj 


3 

12 


Equation (11) then is 
T2 -2 3]=&r-l 


2 -2 

1 1 
1 3 


i ii i] ri o r-15 
1 110-20 0 
1—14 lj Lo 0 3j L 15 


25 -10 
2 -2 
3 12 


(ii) Repeated Latent Roots and Simple Degeneracy. If 

u = [" 2 2 3] , 

10 -4 5 
5-4 6 


then 

A(A) = A —2 
-10 


2 -3 

A+4 -6 

4 A —6 


= A 3 —4A a +5A —2 = (A— 1) 2 (A—2). 


The latent roots are A x = 1, A 2 = 1, A 3 = 2. For the repeated root, 
/(A 2 ) = f “ 1 2-3], 

-10 5 -5 
-5 4 -5, 

which is simply degenerate. The adjoint is 

F(A 2 )= f -5 -2 5], 


‘ -5 

-2 

5' 

-25 

-10 

25 

.-15 

-6 

15. 


and this yields the single modal column {1,5,3} for the two roots 
A x , A 2 . In this case the matrix u is not reducible to the diagonal form. 
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3*6 




(iii) Repeated Latent Roots and Multiple Degeneracy. Let 


so that 
A(A) = 


u = [ 7 4-1 

4 7-1 

-4 -4 4. 


= A 8 -18A 2 + 81A -108 = (A - 3) 8 (A -12). 


A-7 -4 1 

-4 A-7 1 

4 4 A —4 

Denoting the latent roots as A x = A 2 = 3 and A 3 = 12, we obtain 

/(A.) = 



which has degeneracy 2. The adjoint F( A 2 ) is null, but 

-5 4 -1‘ 

4 -5 -1 
L-4 -4 -8. 


(i) 

W = 


-5 41 ri 0 

4 -6 |o 1 

_4 — 4_| 


a- 


Hence we may choose the two modal columns corresponding to the 
double root to be k x = {5, — 4, 4} and k a — {— 4, 6, 4}. The modal 
column appropriate to the simple root A 3 is found to be {— 1, — 1, 1}, 
so that 


k = 

' 5 -4 -1" 

, giving k- 1 = $T 1 0 11 


-4 5 -1 

Oil 


.441. 

i“4 

l 

1 


Accordingly, 


" 7 

4 

-r 

= i 

' 5 

-4 

-1“ 

'3 

0 

0‘ 

’ 1 

0 

1* 

4 

7 

-l 


— 4 

5 

-1 

0 

3 

0 

0 

1 

1 

. — 4 

-4 

4 


4 

4 

1. 

0 

0 

12. 

.-4 

-4 

1. 


(ivj Latent Roots of Matrices Connected by a CoUineatory Trans¬ 
formation. If u = pvp~ l , then A I—u= p(XI—v)p~ 1 . Hence 

| A/—«| = | A/—1> |, 

so that u and v have the same latent roots. When u is reducible to the 
diagonal form kAkr 1 we can write v = p-'kAk^p. The modal matrix 
of v then is p~ x k. 

(v) Latent Roots and Modal Matrix of P(u). Applying the theorem 
of example (iv) to the transformation (12), we see that when u is 
reducible to the diagonal form, P(A X ), P(A 2 ), ..., P(A„) are the latent 
roots of P(w), and that u and P{u) have the same modal matrix k. 
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The matrix P(u) can have repeated latent roots even when A 1} A 2 , 
\ n are all distinct. In this case if P(u) has s latent roots equal to 
then its characteristic matrix will have degeneracy s for the root A s . 


3-7. The Cayley-Hamilton Theorem. In § 3-6 it was incident¬ 
ally shown that a square matrix whose latent roots are all distinct 
satisfies its own characteristic equation. The Cayley-Hamilton theorem 
states that this property holds good generally for any square matrix u . 
Thus 


A (u) = 0. 


.( 1 ) 


Since A(A) = (— 1 ) w (A x — A) (A 2 — A)... (X n — A), the theorem may also 
be written in the factorised form 


(X 1 I-u) (A 2 /-u)...(A n /-w) = 0, 
or /(A 1 )/(A 2 ).../(AJ = 0. 

The proposition follows immediately from the remainder theorems 
of § 3*4. For, if F(X) is the adjoint of the characteristic matrix, then 

/(A) F(X) = A(A) I. .(2) 

Now A(A) I is a A-matrix, and the last equation shows that it has zero 
remainder when divided on the left by XI — u. Hence by the remainder 
theorem A (u) = 0. 

In the foregoing proof no restriction has been placed on the nature 
of the latent roots. The theorem thus holds for matrices with repeated 
latent roots. However, if there are any repeated roots for which /(A) 
is multiply degenerate, the theorem admits some extension. Suppose 
all the first minors in /(A) to have a common factor 0(A). This will be 
a factor of A(A) so that we may write A(A) = 0(A) A^A), and 0(A) will 
evidently also be a factor of all the elements of F( A), so that, say, 
F( A) = 0(A) F p ( A). Hence in place of equation (2) we can now use 

f(\)F p (\) = \(\)I. 

By the same argument as before it follows that 

A» = 0. .(3) 

In the special case where 0(A) is chosen to be the highest common 
factor of the first minors of A(A), written with unity as the coefficient 
of the highest power of A, then A p (A) = A(A)/0(A) is spoken of as the 
reduced characteristic function. It can be shown* that A p (u) then is the 
vanishing polynomial of u of lowest possible degree. 

* See, for instance, Chap, v of Ref. 2. 
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Examples 

(i) Latent Roots all Distinct (see example (i) of § 3-6). If 


u = T2 

-2 

3" 

1 

1 

1 

.1 

3 

- 1 . 


then A(A) = A 3 - 2A 2 - 5A + 6 = (A -1) (A + 2) (A - 3). 

Hence v? — 2 u 2 — 5u + 6/ = 0. 

The factorised form of the theorem is 

(I-u)(-2l-u)(ZI-u) = 0, 
which yields the identity 


-2 3] [4 

0 1 1 
3 -2 1 


4-2 3ir-l “2 


1 -2 1 
1 3 — 4 


r°- 


(ii) Repeated Latent Roots and Simple Degeneracy (see example (ii) 

•> f §3"» W <t- f 2 -2 31, 

10 -4 5 

.5-4 6 

then A(A) = A 3 -4A 2 + 5A-2 = (A- 1) 2 (A- 2). 

Hence u z — 4u 2 + 5u — 21 = 0. 

In this case/(A) is simply degenerate for the repeated root A == 1, and 
the ordinary characteristic function is the vanishing polynomial of 
lowest degree. 

(iii) Repeated Latent Roots and Multiple Degeneracy (see example (iii) 

of § 3-6). Let ... r , , 


I 


so that 


7 4-1 

4 7-1 

-4 -4 4 


A(A) = A-7, -4, 1 = A 3 -18A 2 + 81A-1 

-4, A-7, 1 

4, 4, A —4 

The normal form of the theorem gives 

u 3 — 18w 2 + 81m — 108/ = 0. 


A 3 — 18A 2 + 8lA —108 = (A —3) 2 (A -12). 
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However, /(A) has degeneracy 2 for the double latent root A = 3. 
Extracting from A(A) the common factor (A - 3) of the first minors, we 
obtain the reduced characteristic function 

A p (A) = (A - 3) (A —12) = A 2 — 15A 4- 36. 

Hence the reduced form of the Cayley-Hamilton theorem gives 
u 2 — 15^-f 36/ = 0, or alternatively 

(31 —u) (I2l — u) = [-4 -4 lir 5-4 11=0. 

— 4 -4 1 -4 5 1 

.4 4 - lj L 4 4 8 

(iv) Computation of Positive Powers of a Matrix . Since A (u) = 0, 
we have uA(u) = 0, u 2 A(u) = 0, and so on. By successive applications 
of these equations any positive integral power of u is linearly expressible 
in terms of the unit matrix and of the first n — 1 powers of u, where n 
is the order of u. 

For instance, suppose u to be as for example (i). Then 
u z = 2 u 2 + 5u -6J, 
u A — 2u 2 4- 5u 2 — 6u = 9u 2 + 4u— 127, 
u 6 = 2u A -f 5u B — 6u 2 = 22 u 2 *f 33 u — 54/, 

and so on. 

(v) Computation of Negative Powers of a Matrix. If u is not singular 
the method of example (iv) can also be used to evaluate the negative 
powers, and in particular, the reciprocal, of a matrix. Thus, if u is as 
for example (i), 

6 u~ x as —u 2 + 2u + 5/, 

36 u~ 2 = -6u+12l + 30u~ l = -&u 2 + 4u + 37l, 

and so on. 

(vi) Reduction of Polynomials. Since any positive integral power of 
u is linearly expressible in terms of I and of the first n— 1 powers of u, 
so also is any polynomial P(u). Thus 

P(u) = u4 0 u n ~ 1 +^i^ w - a + ...+-4 n ^ 2 ^+-4 n _i/, 

where the coefficients A { are scalars. 

By a similar process the corresponding scalar polynomial P(A^) is 
reducible to 

P(\) = ^ 0 A?~ 1 +^ 1 A?“ 2 + ...+^i n _ 2 A r +^ 1 . 

However, this expression would not be valid for a general value of A. 
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The argument also applies for expressions P(u) involving negative 
powers, provided that u, is not singular, 

(vii) Linear Difference Equation Satisfied by the Powers of a Matrix. 
If the characteristic equation is 

A(A) =* A w A w_1 -f... +p n = 0, 

then u*&(u) = 0. Hence if Y(s)==u 8+n , the sequence of matrices Y(s), 
Y(s — 1), etc. satisfies the difference equation 

7(5)+^! Y(s- 1 ) + ...+p n Y(s-n) = 0. 

This implies that a sequence formed by any corresponding elements 
taken from Y(s), Y(s — 1), etc. also satisfies the difference equation. 
Let E 8 denote the value in Y(s) of the selected element. Now it is 
known from the theory of difference equations that 

E 8 =Z e { Al .( 4 ) 

i= 1 

where the coefficients e i are constants and A*, A 2 ,..., A n are the roots 
of A(A) = 0, here assumed all distinct. Hence also 

Y 8 = S e,A5, 

i=i 

where the coefficients e t are matrices of constants. The last result is 
a particular case of Sylvester’s theorem (see § 3*9). 


3*8. The Adjoint and Derived Adjoints of the Character¬ 
istic Matrix. Simple expressions for the adjoint and derived adjoints 
of the characteristic matrix will now be obtained. 

Firstly, it will be convenient to construct certain identities in¬ 
volving merely scalars. Write the expanded form of A(A) == | /(A) | as 

A(A) = A w -f p 1 A w “ 1 -f-p 2 '^ n ~ 2 *h ••• +Pn-l^+Pn = II (A —Ay). 

i-i 


Then identically . v 

^■~ A ^ = A n ~* + {x+Pi) A n ~ 2 + • • • 

+ (a:” -1 +p x x n -*+... i) 

= 0(A,a;), say. .(2). 

If x = A,, where A,, is a typical simple root of A(A) = 0, the preceding 
formula gives ^ ^ = A(A)/(A _ At) = n (A _A,)..(3) 

i+r 
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Next, suppose A(A) = 0 to have two equal roots, say A x = A a . On 
differentiation of (2) with respect to x, we obtain 


3 . n _v A(A)-A(a:) A(x) 

(A—as) a 

Hence when x = A a = A x , 

' 9^^ (A ’ Aa) = = (A “ As) (A ~ A 4 ).. • (A -AJ, ...(4) 

( 1 ) 

since A(A 2 ) and A(A 2 ) both vanish. 

More generally, suppose there to be a set of 8 equal roots A 1? A 2 ,..., A a> 

so that A(A a ) = 0 for i = 0,1 ,8 — 1. Differentiating (2) p times with 
respect to x , where p ^ 8 — 1, and substituting x — A a , we obtain 

= (A-A,)‘-*- 1 (A-A, +1 )...(A-A n ). 

.( 6 ) 

The formulae (3) and (5) are scalar identities which are true for all 
values of A, and when A is replaced by the square matrix u corresponding 
matrix identities are derived. Thus (2) and (3) yield 

= ^ n “ 1 + (A r +p 1 )w n “ 2 + ... + (A?“ 1 +^ 1 A”“ 2 +...+jp n ^i)/ 

= IK^-A,/), 

*+r 

while, if there is a set of 8 equal latent roots, equation (5) gives 

4 = ( w -a* ty-v-Hu - Vi /)... (« - A^i), 


.( 6 ) 


in which 0 < s — 1. 

Now replace A by u and x by XI in the identity (2). Then, since 
A(w) =» 0, we obtain 

(XI - u) <f>(u, A) = A(A) I. 

This shows that <p(u, A) is the adjoint matrix of/(A), for general values 
of A. Hence 


F( A) = <f>(u, A) = « n - 1 + (A+p 1 )«* -a +... 

+ (A"- 1 +p 1 A n ~* +... +p n _ 1 ) I. 

In particular, when A = A T , we have 

F(K) = 0(«, A,) - (- l)“-i n (A ,I-u). 

i+r 


•(7) 


( 8 ) 
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More generally, if there are 8 equal roots we obtain from equation (6) 

= ••.(«- KD, 

for —1. 

The results may be summarised as follows: If F(X) is the adjoint 
of the characteristic matrix /(A), then for any latent root X, 


= .(9) 

i+r 

while for any set of 8 equal roots X v A 2 ,..., X 8 , and for — 1, 

= (-l)»-p-i[/(A 4 )]«^-V(A, + 0/(A s+2 ).../( A n ), 

.( 10 ) 


(P) 

in which F(X 8 ) is, as usual, an abbreviation for 



Examples 


(i) Latent Roots all Distinct (see example (i) of § 3*6). Suppose 



with Aj = 1, A 2 = — 2, A s = 3, 


and 


Then, by (9), 




'A —2, 2, —3 ‘ 

-1, A— 1, -1 
. -1, -3, A + l. 


E(l) = (-l)V(-2)/(3) = 






6 

-5 

-5 



F(-2)=f(l)f(3) = 


B 


0 


11 - 11 * 1 , 
1 -1 
-14 14. 


m =/(!)/(- 2 ) - 


“5 

5 

5 


1 4 1* 
1 4 

1 4 


Similarly, 
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(ii) Repeated Latent Roots and Simple Degeneracy (see example (ii) 
of §3-6). Let 

m = T 2 -2 31, so that A x = 1, A 2 = 1, A, = 2, 

10 -4 5 
. 5 —4 6 

and /(A) = [A-2, 2, -31. 


*-2, 2, -31. 

—10,A + 4, -5 
-5, 4, A-6- 


By (9), 


J’(l) = (-l)V(l)/(2)= r -5 -2 5 

- 25 -10 25 
-15 -6 15 


while (10) gives, for n = 3, s = 2, p = 1, 

1 (1 ) 0 — 2 31 

^(1) = (-l>/(2) = 10 -0 5 

5 -4 4. 

(i) 

We may note that F(l) and JP(1) here jointly contain only two linearly 
independent columns. Thus F(l) is of unit rank and yields only one 

distinct column, proportional to (1,6,3}. Again, the first column of 
(X) 

F(l) is proportional to {0,2,1}, and the property to be particularly 

(i) 

noted is that the remaining two columns of F(l) are linear combina¬ 
tions of {1,5,3} and {0,2,1}. Thus 

{_ 2> -0, -4} = — 2{1, 5, 3} + 2{0, 2, 1}, 

{3,5,4}= 3{1, 5, 3} —5{0, 2, 1}. 

The general rule is that if A, is a latent root of multiplicity s, then s 
(and only s) linearly distinct non-vanishing columns will be included 

(i) <»-i) 

amongst the a matrices F(X t ), F (\„),..., F( A,). 

(iii) Repeated Latent Roots and Multiple Degeneracy (see example 
(iii) of §3-0). Take 

u = I" 7 4-11, with A x = 3, Aj = 3, A 8 = 12, 

4 7 -1 

-4 —4 4 J 

/(A)= TA-7, -4, 1‘. 

-4, A-7, 1 
L 4, 4, A-4j 


and 
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Then 

^(3) = (-l) a /(3)/(12) = 0; F{ 3) - ( —1)/(12) = f-5 4—1' 

4 -5 -1 
— 4 -4 -8 

Here the adjoint F(3) corresponding to the double root is null, while 
(i) 

F( 3) has only two linearly independent columns. 

( p) 

(iv) Matrices F( A) and /(A) PermtUable. Since in the case of the 

characteristic matrix —■ = / while ~~ = 0 if p> 1, equation (3*5-1) 
oA q>AP 

( P) (P-1) <p) (P-1) (p) 

gives /(A).F(A)+jplP(A) = i*(A)/(A)+|»J(A) = 7A(A). 

(p) (p) 

Hence /(A) F(\) = JF(A)/(A). 

(y) Properties of Modal Columns . In (3*5*1) put p = 1 and A = A,., 
where A r is an unrepeated latent root. Then 

S(K)F(K)+T(K) = $K)I, 

which, on premultiplication by F(\) and use of the relation 

nK)f(K) = 0 , 

yields f*(K) = $K)F(K)- 

In the notation of § 3-5 

= k r K r k r K r = K r k r F( 

since K r k r is a scalar. Hence (1) 

K r k, = A(A r ). 

Again, from (9) it iB obvious that F( A p ) F( A ff ) = 0 provided that A, 
and Ag do not belong to a common set of equal roots. In particular, if 
Ap and A, are both simple roots 

= 0 . .( 11 ) 

These results show that when the latent roots are all distinct the 
modal matrix k and the matrix k of § 3-6 are connected by the relation 

Kk = d ’ ° r k- 1 = d~ l K, .(12) 

(1) 

where d is the diagonal matrix of the quantities A(A r ). If u, and there¬ 
fore every adjoint J 7 (A r ), is symmetrical we can choose K r — k' r , so that 
k = k' . In this particular case k’k = d, and the modal matrix thus has 
a generalised orthogonal property (compare § 1*17). 
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(vi) Partial Fractions Formulae . In the present example use it made 
of the familiar method of resolution of a rational fraction into partial 
fractions. 

For simplicity all the latent roots of u are assumed distinct. Then 
since (A I—u) F( A) = A(A) /, we may write 

(A/— u)- 1 = ss_ ESQ _ 

1 U) A(A) (A—A x ) (A—A 2 )... (A —A n )' 

Now every element of the matrix F(A)/A(A) is a rational fraction, which 
can be expressed by the methods of ordinary algebra in simple partial 
fractions. Hence, noting that the elements of F( A) are of degree n—1 
at most in A while A(A) has degree n, we may assume for the complete 
matrix an expansion of the form 

F(A) _ A 1 A 2 A n 

A(A) ~A-A 1 + A-A 2 +, “ t A-A n ’ 

The coefficients A lf A 2 , etc. are determinable by the usual methods. 
Thus to find A ri multiply the equations throughout by (A — Ay) and 
then substitute A = Ay. This gives 

(i) 

A r = F( Ay)/A(Ay). 

Hence we derive the identity 

(A /-*)-».(is) 

r - 1 A(A r )(A-A r ) 

This result is also deducible from Sylvester’s theorem (see §3-9). The 
matrix (A I—u)~ l is usually called the resolvent of u . 


3-9. Sylvester’s Theorem. This important theorem states that 
if the n latent roots of u are all distinct and P(u) is any polynomial of 
u, then n 

P{u) = 2 P(K) UK), .(1) 

r-1 


( 1 ) 

where Z^)mF{K)MK )- n (A,/-»)/II 

j+r j+r 


U) 

and A(A,) denotes 



The matrices Z which appear as coefficients of the scalar polynomials 
P(A,) are here given a cipher suffix in order to distinguish them from 
the more general coefficients which are introduced when repeated 
latent roots occur (see § 3-10). 
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To prove the theorem* we note firstly (see example (vi) of §3-7) 
that P(u) and P(A r ) are reducible to the similar forms 

P(u) = A 0 w n_1 + AiU n ~* +... + A n _ x I, 

P(K) — -4o^r _1 + -^l^r - 2 +*"+^n-l- 

Now use formula (2-3-5), with A, substituted for a r . This formula 
becomes applicable since P(«) is here expressed as an equivalent 
polynomial of degree not exceeding n — 1, and it immediately gives (1). 
The theorem is also valid if P(w) contains negative integral powers of 


u, provided that u is not singular, f 
The matrices Z 0 have the properties 

Z 0 (A r )Z 0 (A p ) = 0 ifr+p, .(2) 

and* Z 0 -( A,) = Z 0 (A,), .(3) 

where m is any positive integer. Also 

S Zo(K) - 1- .(4) 

r—1 


Equations (2) and (3) are obtained from results given in example (v) 
of §3-8, while (4) follows on substitution of / for P(u) in (1). 


Examples 


(i) Third Order Matrix. Suppose 


2 -2 


1 1 
1 3 



with A t = 1, A 2 = — 2, A a = 3. 


Prom the results of example (i), § 3-8, 

P(l) = {-1, 1, 1}[3, —5, 2]; 
P(-2) = {11, 1, —14} (0, 1,-1]; 
P(3) = {1, 1, 1} [5, 1, 4]; 


(i) 

A(l) = -6; 
(i) 

A(-2) = 15; 
(i) 

A(3) = 10. 


Hence 30 P(u) = P(l){-1, 1, 1} [-15, 25, -10] 

+ P( —2){11, 1,-14}[0, 2,-2] 
+ P(3){1, 1, 1} [16, 3, 12], 


(5) 


* An alternative proof can be based on the identity (3*6-12) and the relation (3*8*12). 
t Note that if u is not singular no latent root can be zero. 

J On account of the property (3) the matrices Z 0 are said to be idempotent . 
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which can be written 


P(u) = * 

-1 

11 

r 

r^(i) 

0 

0 ‘ 

-15 

25 

- 10 ' 


1 

1 

l 

0 

P( —2) 

0 

0 

2 

-2 


1 

-14 

l. 

0 

0 

m 

15 

3 

12 


This should be compared with the last equation in example (i) of § 3*6. 
If P(u) = u m , and m is large, equation (5) gives the approximation 


*{1,1,1} [16,9,12] 8". 

Here A 3 (= 3) is the dominant latent root, namely, the root having the 
largest modulus. A similar approximation for a high power of a matrix 
holds good in all cases where there is a single dominant root. 


(ii) Fourth Order Matrix. Suppose 


u = 




In this case A(A) = |AZ— u\ = (A 2 — 1)(A 2 —4), giving the latent roots 
A x = 1; A s = — 1; A 8 = 2; A t = — 2. The corresponding adjoint matrices 
(i) 

and values of A(A) are 

(i) 

*(1) = -6{-1, -1, 2, 2}[1, 0, 2, -1]; A(l) = -0; 

(i) 

^(-1) = «{1, 1, - 2, - 3} [0, 0, 1,-1]; A( — 1) - 6; 

(i) 

n 2) - 12{1, o, -1, -1}[3, 1, 2, 0]; A(2) = 12; 

F(- 2) = -12{-1, 1, 1, 1}[1, 1, 1, 0]; A( — 2) - -12. 

Hence P(u) = { - 1 , - 1 , 2, 2} [1, 0, 2, -1] P(l) 

+ {1, 1, — 2, — 3}[0, 0, 1,-1]P(-1) 

+{1. 0,-1,-1} [3, 1, 2, 0] P(2) 

+ {-l, 1, 1, 1}[1, 1, 1, 0]P( —2). 

If P = u m , and m is large, the value of u m is approximately given by 

2!-*»«”* = {1, 0,-1, — 1} [3, 1, 2, 0] + ( —l) m { —1, 1, 1, 1}[1,1, 1, 0]. 

In this case the two latent roots A s and A 4 have equal moduli and are 
dominant. The two corresponding matrices must be retained in the 
approximate expression for u m . 
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(iii) Summation of Infinite Power Series (see § 2*4). Conditions for 
the convergence of an infinite power series in u can be found by the 
use of Sylvester’s theorem. Thus if 

00 

S(u) = E OLiU*, 

i-0 


where the coefficients a are all scalar, it is seen at once that the series 
will converge to a finite limit provided that all the corresponding 
scalar series S(X r ) are convergent. 

As a simple example of the summation of a series by Sylvester’s 
theorem we may assume 


u = 


015 

-0-25 


— 0-011, 
0-15J 


and 

Here 


S(u) = 


/(A) = 


[A-0-15 0-01 1; 

F( A) = TA-0-15 

-001 ] 

[ 0-25 A —0-15J 

[ -0-25 

A — 0-lsJ 


and A(A) = A a - 0-3A+002. 

The latent roots are A x = 0-1 and A, = 0-2, and these give 


with 


F( 01) 


r-o-05 

-0-011; 

JF’(0-2) = r 0-05 

-0-011 

[-0-25 

-o-oej 

[-0-25 

o-osj 


(X) (i) 

A(0-1) = —0-1 and A(0-2) = 01. 


Again 


8 ( 0 - 1 ) 


1 

1-0-1 


10 
9 * 


S( 0-2) = 


1 

1 - 0-2 


10 
8 • 


Hence finally (see also example, § 2-4) 



8 ( 0 - 2 ) 

0-1 


(iv) Fractional Powers of a Matrix (see § 2-2). Sylvester’s theorem 
has only been proved for rational integral functions P(u), but it 
suggests that if, for example, it is a second order square matrix, then 

[AiZ 0 (A 1 ) + ^Z 0 (A g )]a = «. .(6) 

Now by direct multiplication 

M^AJ + AiZofA,)]* 

= A.ZftAJ + AtA|[Z 0 (A 1 ) Z 0 (A 2 ) + Z 0 (A 8 ) Z 0 (A X )]+A, Z*(A,). 
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On account of the properties (2) and (3) the right-hand side of this 
equation reduces to A x Z 0 (X i) -f A 2 Z 0 (A 2 ), i.e. to u. Hence (6) is verified. 

Evidently by the foregoing method four possible square roots of u 
can be obtained, namely ± AfZ 0 (A 1 ) ± A|Z 0 (A 2 ), the signs being here 
associated in all possible combinations. More generally, if u is a square 
matrix of order n with distinct latent roots, then 2 AJ /m Z 0 (A r ) is an mth 
root of u , and since AJ /m has m distinct values and there are n latent 
roots, it will be possible to construct m n such roots. The foregoing 
argument would, of course, require modification if some of the latent 
roots were repeated. 

(v) The Matrices Z 0 as Selective Operators . Suppose we have to 
deal with sets of objects which are drawn from n mutually exclusive 
classes 1,2, ...,w— l,n. Let U represent a set, and U r the subset of XJ 
which belongs to the class r. Then obviously 

U = £u r . .( 7 ) 

r-1 


Again, let S r denote the operation of selecting from U all the objects 
belonging to the class r; then 



s r u = u r . 

.(8) 

Accordingly, 

V = S S r U, 



r= 1 


or 

S Sr = 1 . 

.( 9 ) 


r« 1 


Next, apply S r to both sides of (8). Now clearly S r U r = U ri since U r 


consists entirely of members of the class r. Hence 

S*U = ££ = S r U } 

so that Sf = S r . .(10) 

Further, S p U r = 0 if p + r, for U r contains no members of the class p . 
Apply S p to equation (8). The result is S p S r U = 0, or 

S P S r = 0 (p+r). .(11) 


Operators which possess the properties given by (9), (10) and (11) are 
said to fonnja spectral set * From the present standpoint the interest 
lies in the fact that the matrices Z 0 (A r ) can be regarded as selective 
operators of this type. 


For a simple account see, for instance, Chapter xn of Ref. 8. 
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3-10.* Confluent Form of Sylvester’s Theorem. The theorem 
given in §3-9 requires modification if two or more of the latent roots 
are equal. Before discussing this modification we shall give an alter¬ 
native proof of the theorem for the case of distinct roots. 

Since P(A) / — P{u) is exactly divisible by XI—u, it follows that 

P(u) = P(X)I + Q 1 (X)(XI-u), 

where Qi(X) is a polynomial of A and u which need not be determined. 
This equation may be written 

P(u)(XI-u)~' = ™ + e i( A). 


The first matrix on the right can be resolved into partial fractions, as 
illustrated in example (vi) of § 3-8. Since the degree in A of P(A) F( A) 
will in general exceed that of A(A), the correct form to assume in the 
present case is 

: + QM), 


P(A)P(A) » 

A(A) r=i (A - 


K) 


and the typical coefficient A r is given by 


(i) 


It follows that 


A r = P(X r )F(X r )IA(X r ). 


P(u)(XI-u)~' = £ ( ^- + Q 1 (A ) + Q,(A). 

r ’ 1 A(A r )(A-A r ) 

Evidently, since P{u) is independent of A, the quotient Q x + Q z must 
be zero. Hence n . 

. (!) 

r “ 1 A(A r )(A-A r ) 

Similarly, replacing P(u) by uP(u) we have 


UP(U)(\I-U)-' = £ K PjAWr) , 
r - 1 A(A r )(A-A r ) 


•( 2 ) 


Multiply (1) throughout by A and subtract (2). Then 


n 


P(U) = S 

r=»l 


P(X r )F(X r ) 

(i) » 

A(A r ) 


(3) 


which is the theorem of § 3-9. 

To extend the theorem to cover the case of multiple latent roots, it 
is only necessary to modify appropriately the expansion involving the 
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partial fractions, and it will be enough to consider those terms which 
arise from a typical set of a equal roots A lt A a ,A,. Thus assume 

A 




i+B, 


A(A) ^(A-A.)* 

where R includes the quotient and the partial fractions arising from 
the remaining roots. Then by the usual methods of algebra 


1 

r d- 1 P(A) P(A)“J 

|s-i 

ldX°-< A S (A) J 


_U-A,’ 

where A,(A) s (A - A, +1 ) (A - A s+a ) ... (A-A n ). 

The terms in the right-hand side of the equation corresponding to (1), 
due to the set of equal roots considered, are then 

f A > 

Similarly, those contributed to the equation corresponding to (2) are 

Bt 


$ 

2 


where 


Is — »L< 


a (A-a t r 

dT 1 AP(A) P(A)"1 
dX°~< A,(A) 


Ja-a, 


Hence the terms contributed to the equation corresponding to (3) are 

f. XAt-Bt 


But B t 


<-1 (A—A ,) 1 

: X a A { +A {+1 , so that the last summation reduces to 


v — A«) — Aj+i 

z* /A •» \i 


A v 


<-i (A-A,) 1 

The term in the expression for P(«) due to the roots A, is accordingly 


1 f d°-' P(A) P(A) ~[ 
^LdX'-i A,(A) _| A _ A , 


The confluent form of Sylvester's theorem may thus be stated as 

* _1 P(A)P(A)-| 

a,(A) _L/ -• 


P(u) = ZT(A.) = Z ; 


1 Td- 1 . 
i — 1 l_dA* -1 


•W 


where A,(A)s(A—A m )(A—A l+a )...(A—AJ, and the summation is 
taken for all distinct values A, of the latent roots. 




in which 
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The typical term T(\ a ) is expressible in the expanded form 

(1) (a-l) 

nA.) - f (A.) V 1 (A,) + ^Z_,(A,) +... + ^Z„(A t ), 

ir^ mi 
<(a) “ ^U^alL; 

for * = 0,1,1, and P(A g ) denotes ( -^ A ^ ) 

\ »A l / A—A* 

An alternative is 

(s-l) (1) (*—2) (2) («-3) (f-1) 

F(A g ) P(A g ) F(A g )P(A g ) F(A,)i’(A.) , F(A g )F(A g ) 

1 - j^n + HEI" .117-3" + - + is-i ’ 

«> (0 

where F(A) = P(A)/A g (A) and F(A g ), P(A g ) have meanings similar to 

«> 

P(A g ). On application of (3-8-10) the last equation may be written also 
as (2) 

T(K) = (-i)»-[y(A,)/- r ( (X)/(A,)+ r(A, |f (A ^ +-. 

(»-l) 

+ (-1)*- 1 r(A ^ T 1(A » ) ]/(A, + i)/(A g+ii ).../(A.). 

If /(A,) has degeneracy q> then the highest power of /(A*) present in the 
foregoing expression will be a — q. This also follows from a consideration 
of the reduced characteristic function (see § 3*7). 

Examples 

(i) Matrix with Latent Roots all Equal . If the latent roots of u are 
all equal, so that a = n, then = 1, Y = P(A g ). Hence 

( 2 ) 

P(u) = P(A n )-P(A n )(A„/-«)+^(A n /-«)*+... 


I? 


In particular 


(»-i> 

+ (- 1)n - 1 ^- (A n^-« )n - 1 - 


«» = A^/-mA^- 1 (A n 7-«)+^^- ) A^- 2 (A n /-tt)*+... 

+ (- :1)+ Ay-n + l ( A|g 7 —tt) 1 *- 1 . 
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This result is verified immediately by the identity 

u m =[X n I-(X n I-u)] m 

and an application of the Cayley-Hamilton theorem, which in this 
case gives (A n I — u) n = 0. 


(ii) Sylvester's Expansion for Matrix u itself . If P(u) = u, then 
^ = Si A r Z^Xf) -h S 2 [X 8 Z 8 _ x ( X 8 ) 4- Z 8 _ 2(A s )], .(5) 


where the first summation includes all unrepeated roots and the second 
is taken for all distinct values X 8 of repeated roots. 

In the particular case where the characteristic matrix has full 
degeneracy s for every set of s repeated roots, the derived adjoint 

(a-2) 

matrices up to, and including, F(X S ) are all null (see Theorem (E) of 

(a—1) 

§ 3*5). Hence Z 8 _ 2 (X 8 ) = 0 and Z 8 _ x (X 8 ) = F{X 8 ) / |&— I A 8 (X 8 ). Equation 
(6) may then be written in the notation of § 3-6 


u = S 


AA*a 

I*-1A,(AJ’ 


where the summation is for all distinct values of the latent roots. 
A comparison of this with the equation u — &A& -1 , in which the modal 
matrix k is now composed of all the columns of the matrices k 8 (see 
§ 3*6), readily yields the identity 


Kk = d or k~ l = d^K, .(6) 

where d is a diagonal matrix which is expressible in partitioned 
form as 

J*|a--1 A„(A a ) I a 0 ... ° -I, 

I 0 |6 — 1 Afe(Aft) lb ... 0 I 


L 0 ® ••• Iffn 1 A P (Ap) J p I 

and a, 6,..., p are the multiplicities in the sets of equal roots. Equation 
(6) is a generalisation of (3*8*12). 


(iii) Two Equal Latent Roots and Simple Degeneracy. If 


10 -4 5 
L 5 -4 6j 
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then A(A) = (A- 1) 2 (A —2) and A x = A 2 
form of Sylvester’s theorem gives 


1, A 3 = 2. The confluent 




mmi 
(A-2) J A=1 

(i) 


P(2)F(2) 


( 2 — l) 2 
(i) 

= P(1)[-F(1)-F(1)]-P(1)F(1) + P(2) F(2). 


Using results obtained in example (ii) of § 3-8, we obtain 

P(*) = 


5 

4 

-8 

p{ i)+ 

5 

2 

-5 

15 

16 

-30 


25 

10 

-25 

_10 

10 

-19. 


,15 

6 

-15. 


(i) 

P( 1) 


+ 


-4 -4 

-15 -15 
-10 -10 


8 ' 

30 

20 


P( 2). 


(iv) Two Equal Latent Roots and Multiple Degeneracy. If 


u — 




the latent roots are A x = A 2 = 3, A 3 = 12, and on reference to the 
results of example (iii) of § 3-8, we find 


9P(w) = 

'5-4 1“ 

-4 5 1 

P(3) + 

4 4 -r 

4 4-1 


.448 


.-4 -4 1. 


(i) 

The term involving P(3) is absent since P(3) = 0. 


P( 12). 


Part II. Canonical Forms 

3*11. Elementary Operations on Matrices. An important 
conception relating to matrices is that of equivalence, but before 
defining equivalent matrices we shall consider certain elementary 
operations which are, in fact, extensively used in the reduction of 
determinants. These operations are: 

(I) Interchange of two rows, or of two columns. 

(II) Addition to a row of a multiple of another row, or addition to 
a column of a multiple of another column. 

(Ill) Multiplication of a row or of a column by a non-vanishing 
constant. 
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It will now be shown that the performance of any one of these 
operations upon a square matrix u is equivalent to premultiplication 
or postmultiplication of u by a non-singular matrix. 

Operation of Type I. Let J denote the unit matrix I with the ith 
and the jth rows interchanged. Then it is easy to see that the product 
Ju is the matrix obtained when the ith and jth rows of u are inter¬ 
changed. Similarly uJ is the matrix obtained from u by interchange 
of the ith and jth columns. 

The determinant of J is — 1. 

Examples 

(i) rl 0 Oi r« u u 12 = ptn u lt . 

0 0 1 L ttaa U n U 3t ttjj 

.0 1 OJ Uai «** “sJ “22 «23- 

(ii) ran tt 12 w 13 1 rl 0 01 ruj! . 

«21 “22 y 0 0 1 “21 “23 «22 

“31 “32 “33-1 LO 1 OJ t “81 “33 “s 2 - 

Operation of Type II. Let L be the unit matrix modified by the 
introduction of the element l in the ith row and jth column, where * 
and j are unequal. Then Lu is the matrix obtained from u by addition 
of l times the jth row to the ith row. Similarly uL is the matrix obtained 
from u by addition of l times the ith column to the jth column. 

The determinant of L is 1. 

Examples 

(i) rl 0 Oi r« 11 it 18 u 13 = “n» “ia> “is 

10 1 oil « 81 lt 22 «23 “sd “22> “23 

Lo l lj U 81 « 82 Lw 31 + lu 2V u 32 + lu 3t , Mja -f lu^. 

(ii) run « 12 u l3 l rl 0 01 Vu u , w 12 + l« 13 , m 13 1 . 

I ttjj U 33 “28 II® I ® “ 21 > “22 f ^“231 “23 

L«si «82 “33-1 L° l 1-1 L“S 1 > U 3 ^ + IU 33 , %. 

Operation of Type III. Let H denote the unit matrix with h (+0) 
substituted for unity in the ith element in the principal diagonal. Then 
Hu equals the matrix u with its ith row of elements multiplied by h, 
and uH is the matrix u with its ith column multiplied by h. 

The determinant of If is A. 
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Examples 


(i) 

1 

0 

0 - 

'<*11 

<*12 


<* 13 ~ 

= 

' <*n> 

<* 12 > 

<*13 ’ 


0 

1 

0 

<*21 

<*22 


<*23 


<* 21 > 

<*22 > 

<*23 


.0 

0 

h. 

-<*31 

<*32 


<* 33 - 


_Zw*3i, 

^<*32’ 

A<*33_ 

(ii) 

M n 


U 12 

<*13 

1 

0 

o - 

= 

'<*11 

<*12 

A<*13 


u 21 


U 22 

<*23 

0 

1 

0 


<*21 

<*22 

^<*23 


- U 31 


U Z 2 

<* 33 - 

.0 

0 

h. 


-<*31 

<*32 

^<* 33 - 


3-12. Equivalent Matrices. Two matrices are said to be equi¬ 
valent if one can be derived from the other by any finite number of 
operations of the types specified in §3* 11. It is easy to see that the 
relation of equivalence is of a reciprocal character, since the inverse 
of any one of the three elementary operations is another operation of 
the same type. 

Since any elementary operation upon a matrix u is equivalent to 
premultiplication or postmultiplication of u by a non-singular matrix, 
it follows that any matrix v which is equivalent to u must be related to 
u by the equation „ = Pu q . (1) 


where P and Q are non-singular matrices. It can in fact be proved^ 
that if v = PuQ, where P and Q are non-singular, then u and v are 
necessarily equivalent in accordance with the definition already given. 

It is important to note that, since the rank of a matrix is clearly 
unaltered by any of the elementary operations referred to in §3*11, 
equivalent matrices have the same rank. 


3-13. A Canonical Form for Square Matrices of Rank r. 

We shall now show that any square matrix u of rank r is equivalent to 
a canonical (or standard type) matrix C , whose elements are all zero 
with the exception of r units occupying the first r places in the prin¬ 
cipal diagonal. 

If the matrix is null, its rank is zero, and it is already in the canonical 
form. If it is not null, then bring a non-zero element to the top place 
in the principal diagonal by operations of type I (§ 3*11), and reduce 
this element to unity by an operation of type III. Next reduce all the 
other elements of the top row and of the first column to zero, by opera¬ 
tions of type II. If the elements lying below the first row are now not 


* See, for instance, Ref. 1. 
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all zero, bring a finite element to the second place in the principal 
diagonal: this can always be done without alteration of the first roff 
or column. The element in question can then be reduced to unity, and 
the remaining elements of the second row and second column reduced 
to zero, in the same way as before. It is evident that in this way u can 
always be reduced to a matrix C of the canonical form, with say t con¬ 
secutive units in the principal diagonal. Since C is equivalent to u , and 
thus has the same rank as u (see § 3*12), it follows that t = r. 

Examples 

(i) Non-Singular Matrix . In the trivial case where u is non-singular 
the canonical matrix C is simply I . Hence in equation (3-12-1) we may 
choose P = u * 1 and Q = I. 

(ii) Singular Matrix . Suppose 

u= f 1 2-3], 



which is of rank 2. Then by combined operations of type II 

i o o]r i 2 -3|[i oi = ri o o], 

110-1 2-1 0 1 f 002 

.0—1 lj L— 1 —3 4JL0 1 1J |0 0 -i 

and by further elementary operations 

“i o o] ri o o] ri o o] = ri o o’ . 

00100 20 01 010 
0 1 fj Lo 0 -fj [.0 -f OJ 10 0 0. 

These relations yield 

c=ri o oi = ri o o]r i 2-3]ri or. 

0 10 0-11-1 2 -1 0 -f 1 

0 0 OJ LI i ilL-1 —3 4 JL 0 -f 1 . 

3-14. Equivalent Lambda-Matrices, Two A-matrices are equi¬ 
valent when one can be derived from the other by means of the three 
elementary operations defined in §3-11. The elements of the matrix 
multipliers which correspond to the performance of an operation may 
depend on A, but the restriction is introduced that the determinants 
of these matrices must be independent of A, and as before they must not 
vanish. If gr(A) is equivalent to /(A), then 

^(A) = P/(A)Q, 


( 1 ) 
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where P and Q are, in general, A-matrices whose determinants are 
non-vanishing constants. Conversely, if g and / are related as in 
equation ( 1 ), and if | P | and | Q | are non-vanishing constants, then 
g and/are equivalent in accordance with the definition given. 

Clearly if g is equivalent to /, and h is equivalent to < 7 , then h is also 
equivalent to/: this is a transitive property of equivalence. 


3*15. Smith’s Canonical Form for Lambda-Matrices. A 

canonical diagonal form for A-matrices was established by Smith in 
accordance with the following theorem.* 

Any A-matrix/(A) of order n and rank r can be reduced to an equi¬ 
valent diagonal form 


E( A) EE 


'^i(A), 0 , 0 , ..., 0 ‘ 

0 , E 2 ( A),..., 0 , 0 

0 , 0 , ...,E r ( A ),...,0 

0 , 0 , 0 , ...,0 


.( 1 ) 


containing r isolated elements E p ( A) in the principal diagonal. The 
elements E p ( A) are such that 


E x (\) - S v 
E 2 (^) — 


Ep{^) — <^ 1^2 ••• 


( 2 ) 


where each S is a unit or a polynomial in A having unity for the co¬ 
efficient of its highest power. Moreover, this diagonal form is unique, 
and is the same for all matrices equivalent to the given A-matrix. 
The elements E p (X) are called the invariant factors of /(A). Each 
invariant factor is a factor of all those that follow it in the sequence. 

Let D p ( A) be the greatest common divisor! of the p-rowed minor 
determinants of/(A), written with unity as coefficient of the highest 
power of A, and adopt the convention that i) 0 (A) = 1 . Then it can be 


shown that the invariant factors are given by 

E p (\) - D p {X)jD p ^\). .(3) 

Suppose that E p (\) — (A—A 1 ) e pi(A-A 2 ) e p*.... .(4) 

♦ For proofs see Refs. 1 or 2. 


t The criterion of divisibility is absence of a remainder. A zero determinant is therefore 
divisible by any polynomial of A. 
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Then those factors (A — A ff ) e re which are not mere constants are called 
the elementary divisors of/(A), while (A —A x ), (A— # A 2 ), etc. are the 
linear factors . The invariant factors, elementary divisors, and linear 
factors are invariant: that is to say, they are the same for all matrices 
equivalent to /(A). 

Since E = PfQ , it follows that \E \ = | P 11 /11 Q |. But | P | and 
| Q | are independent of A. Hence the roots of A(A) = | /(A) | = 0 are the 
same as the roots of | E | = 0, and these are the numbers A x , A 2 , etc. 
which appear in (4). The total multiplicity of the root A s is 2}e pa . If 

p 

the roots are not repeated, it follows that E p ( A) = 1 when p + n, while 
E n (\) is A(A) divided by the coefficient of the highest power of A in 
A(A). 

Example 


Canonical Form for a Lambda-Matrix . It can readily be verified 
that if 


then 


m 


m = rA -a 

0 A*—A 
LA 8 -A 8 
= P(A)rA 0 

0 A(A— 1 ) 
L 0 0 


0 

A 3 -A 8 
A 4 —A. 

0 
0 

A(A a — 1)J 


Q( A), 


where P( A) = 

’I 0 0 ' 

0 10 

and Q(A) = 

1-1 O' 
0 1 A 


A 8 A 1 . 


0 0 1 . 


Here | P(A) | = 1 and | Q( A) | = 1 , so that both determinants are non- 
vanishing constants. Smith’s canonical form for /(A) is 


m = 

'A 0 

0 A(A-l) 

0 ' 
0 

= 

[^(A) 

0 

0 

-®*(A) 

o - 

0 


.0 0 

A(A 8 — 1 ). 


0 

0 

mi 


Note that here A(A) = A 3 (A— 1 ) 8 (A+ 1 ), that the 2 -rowed minor 
determinants of/(A) have A 8 (A — 1 ) as a common factor, and that the 
1 -rowed minor determinants have A as a common factor. Hence (see 
(3)), P 3 (A) = A 3 (A — 1 )(A 2 — 1 ); D a (A) = A 8 (A- 1 ); D t ( A) = A; and the 
invariant factors are 


P 3 (A) = A(A 2 - 1 ); P a (A) = A(A-l); E x ( A) = A. 
The linear factors are A, A — 1 and A+ 1 . 
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3*16. Collineatory Transformation of a Numerical Matrix 
to a Canonical Form. It has been shown in § 3*13 that an arbitrary 
numerical* matrix u is equivalent to a certain diagonal matrix (7, so 


that 


u = pCq> 


where p and q are not singular. When the matrices p and q in this 
equation are reciprocal, then u and C have the same latent roots (see 
§ 3-6, example (iv)). Moreover, if P(u) is any polynomial of u, then 


P(C)=p-'P(u)p, 


as illustrated by equation (3-6-12). The usefulness of this collineatory 
transformation is so obvious, and the ease of calculation with a 
diagonal matrix so great, that it is natural to consider whether such 
a transformation can always be effected. 

It has already been shown in §3-6 that the collineatory trans¬ 
formation is always possible when the latent roots of u are all distinct. 
However, the transformation is not always possible wh^n there are 
repeated latent roots. It is foundf that the general canonical form is a 
diagonal matrix with a certain number of unit elements added in the 
superdiagonal, t The rule is that the unit element is certainly absent 
if the place in the superdiagonal is adjacent horizontally and vertically 
to distinct elements in the principal diagonal, but that it may be 
present when the adjacent elements in the principal diagonal are the 
same. Thus the typical canonical form is 



* The adjective “numerical” is added in order to emphasise that a A-matrix is not 
intended. f For a full discussion of this question see Chap, vi of Ref. 2. 

X The elements immediately to the right of the principal diagonal are the superdiagonal 
elements. 
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where the elements in the principal diagonal are, of course, the latent 
roots. It is clear that the canonical matrix can be partitioned into 
a diagonal matrix of square submatrices whose rows and columns 
do not overlap, and which are all of the simple classical type 

1 . 
a 1 
a 1 
a_ 

The order of the simple classical submatrix can range from 1 (the 
elementary case) to n , the order of the matrix itself. 

The exact type of the canonical matrix is conveniently specified by 
means of its Segre characteristic. This consists of a set of integers which 
are the orders of the classical submatrices; those integers which corre¬ 
spond to submatrices containing the same latent root are bracketed 
together. For example, the Segre characteristic of 

"a 1 
a 

a 

fi i 
p i 
p 

7 

d 1 
d 

S_ 

is [(21) 31 (21)]. In the elementary case where the latent roots are all 
distinct the Segre characteristic consists of n unbracketed units. 

The essential point in the proof that (1) really is an irreducible form 
j consists in showing that a unit in the superdiagonal adjacent to a pair 
'.of equal elements in the principal diagonal cannot be removed by a 
collineatory transformation. This can be illustrated by a simple 
example. Suppose that it were required to transform 

A = [A 1] 

Lo Aj 

to the diagonal form by a collineatory transformation. Let 

® = f a 61. 
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so that i’-i = *- 1 [* c /]> 

where S = ad — bc^O. Then 

p-^Ap — S- 1 ^ d^a + c)- A 2 6c, d(\ 1 b-bd) — \ 2 bd "1. 

L-c(A 1 a + c) + A 2 ac, -c(A 1 6 + d) + A 2 adj 
If both the non-diagonal elements are to vanish, 

(A^ — A 2 ) bd + d ^ = 0 , 
and (A x - A 2 ) ac + c 2 = 0 . 

When A 1 4 =A 2 these equations can be satisfied without violation of 
the condition 5 + 0 , but when A x = A 2 the only solution is d = c = 0 , 
which is not permissible, for then 5 = 0 . 



CHAPTER IV 


MISCELLANEOUS NUMERICAL METHODS 

4*1. Range of the Subjects Treated. This chapter is divided 
into three sections. Part I deals with the computation of determinants, 
reciprocal matrices, and adjoint matrices, and with the numerical 
solution of systems of linear algebraic equations. Part II is concerned 
with the limiting forms of high powers of matrices and with the 
approximate calculation of latent roots. In this connection iterative 
methods are developed which are applied in Chapter x to the solution 
of dynamical problems. The approximate solution of algebraic equa¬ 
tions of general degree and the computation of Sturm’s functions and 
test functions for stability are briefly considered in Part III. 


Part I. Determinants, Reciprocal and Adjoint 
Matrices, and Systems of Linear Algebraic 
Equations 

4*2. Preliminary Remarks. Before describing the actual methods 
of computation we shall deal briefly with their underlying principles. 

A system of n linearly independent linear equations in n variables 
x may be represented by ax — h 

where a is a given non-singular square matrix of order n, and A is a 
column of n constants. The formal solution of the equations is 

x — a~ l h. 

If a - " 1 can be determined, the values of x corresponding to any set 
of constants h can be found immediately. Now a~ x = Aj\ a |, where A 
is the adjoint of a, and it is therefore possible to obtain a ~ 1 by a com¬ 
putation of the determinant | a | and of its first minors. Unless the 
matrix a is of low order this process is very laborious, and it is usually 
preferable to build up the reciprocal matrix in stages by elementary 
operations on the columns or the rows of a. 

One possible procedure is as follows. Postmultiply the equation 
ar x a = / by a non-singular matrix M 1 to give a~ 1 (aJf 1 ) = IM V The 
postmultiplication of a by M x represents some operation performed 
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on the columns of a , while the postmultiplication of I by M x represents 
the same operation on the columns of the unit matrix. A succession 
of 8 operations of this kind would give 

ar x <r = M 1 M 2 ..,M Si 

where cr = aM 1 M 2 ...M 8 . Hence if er is such that its reciprocal can be 
found easily, then a ”* 1 is calculable by the formula a -1 = M 1 M 2 ...M 8 <r~ 1 . 
Again, if the determinant | cr | can be evaluated easily, and if each of 
the multipliers M has unit determinant, then 

*1 = | <r|. 

An alternative is to apply to the equation oa -1 = / a succession 
of premultipliers, representing operations performed on rows. The 
choice of rows, or of columns, for the operations is in fact decided by 
convenience only. 

The derivation of suitable types of matrix cr and of the appropriate 
multipliers will now be considered. 

4-3. Triangular and Related Matrices. A matrix having only 
zero elements either to the right or to the left of its principal diagonal 
will be referred to as a triangular matrix. If the matrix is non-singular, 
none of its principal diagonal elements can be zero. Simple examples 
of non-singular triangular matrices are 


"2 

0 

O' 

t 

4 

5 

r 

3 

3 

0 


0 

3 

3 

4 

5 

1 „ 


0 

0 

2 . 


A non-singular matrix can always be reduced to a matrix of triangular 
form by operations on columns only, or on rows only, in an indefinitely 
large number of ways. One possible scheme of reduction, by operations 
on columns only, will be described. 

Firstly, choosing any convenient non-zero element of the top row,* 
say the ith, we reduce to zero all the other elements of the top row by 
subtracting suitable multiples of the ith column from the other 
columns in turn, and leave the ith column unchanged. Next, choosing 
any convenient non-zero element of the second row of the new matrix, 
say the jth (where j + i), and leaving both the ith and the jth columns 
unchanged, we can in the same way annul all the elements of the 
second row other than the ith and the jth. By a continuation of this 

* Each row of a non-singular matrix must contain at least one non-zero element. 
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4*3 


process we are ultimately led to a matrix which, if not immediately 
triangular, is a triangular matrix with its columns deranged. 

The operations described can conveniently be represented by a 
sequence of postmultiplications of the given matrix a. To avoid 
unwieldy general formulae the process will be illustrated with reference 
to a matrix of order four. 

Dealing firstly with the simplest case where the chosen element of 
the first row is a n (shown “starred” below), we have 


a n 

a i2 

a i3 

a i4 


"1 

— a 12 la n 

“ a i3l a ll 

— a 14 /o 11 

= 

a u 

0 

0 

0" 

a 21 

a 22 

a 23 

a 24 


0 

1 

0 

0 


a 21 

6ll 

^12 

^13 

a 31 

a 32 

a 33 

a 34 


0 

0 

1 

0 


a 31 

b 21 

^22 

^23 

- a 41 

a 42 

a 43 

a 44- 


_0 

0 

0 

1 


_ a 41 

^31 

^32 

^33_ 


( 1 ) 


where b {j are new elements which need not here be specified. The post¬ 
multiplier in this equation is the unit matrix / 4 augmented by a first 
row of elements derived from the first row of a in the manner shown. 
Next, if 6 n (shown starred below) is the chosen element of the new 
second row, 


"Oil 

0 

0 

0 " 


“1 

0 

0 

0 ~ 

= 

"Oil 

0 

0 

0“ 

o 2 i 

6* 

b 12 

^13 


0 

1 

— 6 12 /6h 

~~ ^ 13/^11 


®21 

bn 

0 

0 

o 3 l 

^21 

b 22 

^23 


0 

0 

1 

0 


a 31 

b 21 

c n 

C 12 

-«41 

^31 

ca 

eo 

•O 

^33_ 


_0 

0 

0 

1 


_°41 

b^i 

C 21 

C 22_ 


( 2 ) 


where are new elements. Lastly, if c n (shown starred below) is the 
chosen element of the third row, 


a il 

0 

0 

0~ 


“1 

0 

0 

0 " 

= 

'Oil 

0 

0 

0 ‘ 

a 2l 

bn 

0 

0 


0 

1 

0 

0 


°21 

bn 

0 

0 

a 31 

b 21 

c n 

C 12 


0 

0 

1 

— c 12 /c u 


®31 

b 21 

c n 

0 

- a 41 

bzi 

C 21 

C 22- 


_0 

0 

0 

1 


.041 

bsi 

C 21 

^ii_ 


(3) 


Denoting the postmultipliers in equations (1), (2), (3), respectively, 
by M v M 2i M 3 , and the final triangular matrix by r, we obtain 

a,M 1 M 2 M z — r. 

Since each of the multipliers M has unit determinant, it follows that 
| a | = | t | = ^n6nC n d n . 
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It is easy to show that the product M 1 M 2 M 3 is itself a triangular matrix. 
Thus 

M -± M 2 — 1 ^12/^11 ^13/^11 ^14/^11 10 0 0 

01 0 0 0 1 -6 12 /6 U -6 13 /6 u 

00 1 0001 0 
_0 0 0 1 J1_0 0 0 1 

= 1 — ®i2,/ a n a l a 2 » 

0 1 — ^12/^11 ~&13 fin 

0 0 1 0 

0 0 0 1 

where a v ac 2 are new elements. Hence 

M x M 2 M 3 = T1 — a 12 /ct 11 a, a 2 "lTl 0 0 0 

0 1 “^12/^11 — ^13/^11 010 0 

0 0 1 0 0 0 1 — c 12 lc u 

0 0 0 1 J|_0 0 0 1 

= ri -a 12 /a u a x A ]> 

® 1 ~ ^12/^11 

0 0 1 ““^12/^11 

0 0 0 1 

in which fi v fi 2 are new elements. The product M 1 M 2 M 3 in this case is 
triangular and “opposite-handed” to the triangular matrix r. 

The particular method of reduction given would fail if any one of the 
starred elements were zero. A more general treatment is possible in 
which the rows are taken in any order and the starred elements are 
not necessarily in the principal diagonal. In this case the final matrix 
will, in general, not be triangular, but will be a triangular matrix with 
its rows in some way interchanged, and possibly with its columns 
also in some way interchanged. To illustrate the rule for the con¬ 
struction of the multipliers, suppose a Z2 to be the first starred element ; 
then 

®11 ®12 ® I 3 ®14 1 0 0 0 = ^12 ^13 * 

®21 ^22 ®23 ®24 ~^ 3l/ tt 32 ^ — ^ 33/^32 “^ 34/^32 ^21 ^22 ^22 ^23 

®31 ^32 &34 0 0 1 0 0 0 0 

JL 41 ®42 ®43 ^44_ 0 0 0 1 _^31 ®42 ^32 ^33_ 
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The multiplier is here constructed similarly to that in (1), except that 
its second row now contains the elements derived from the third row 
of a. More generally, if the starred element in a lies in the tth row and 
jth column, the elements derived from the ith row of a are entered in 
the jth row of the multiplier. The process of reduction will be clear 
from the numerical examples which follow. 


Examples 

(i) Rows and Columns taken in Consecutive Order. If 

a = T1 4 1 31 , 

0-1 3-1 

3 10 2 

.1 -2 5 1. 

then the scheme of reduction, with the starred elements in the prin¬ 
cipal diagonal, is 

'1* 4 1 31 p -4 -1 -31 = p 0 0 01, 

0 -1 3 -1 0 1 0 0 0 -1 3 -1 

3 1 0 2 0 0 1 0 3 -11 -3 -7 

.1 -2 5 1J Lo 0 0 lJ Ll -6 4 - 2 . 

"l o o oi ri o o oi = ri o oi, 

0-1* 3-1 013-1 0-1 00 

3 -11 -3 -7 0 0 1 0 3 -11 -36 4 

.1-6 4 -2J LO 0 0 lJ Ll -6 -14 4. 


1 0 0 0 
0-1 0 0 
3 -11 -36* 4 
.1 -6 -14 4 


10 0 
0-1 0 
3 -11 -36 
1 -6 -14 


Hence | a | = 1 x (— 1) x (— 36) x (^) = 88. 

The product of the multipliers works out as 


m x m z m 3 



(ii) Rows, but not Columns, taken in Consecutive Order. With a as 
for example (i), another possible scheme of reduction is 
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’1 4 1* 

3“ 


m 


10 0 

O' 

= 

■ 0 

0 

i 

0 




0-13- 

-1 




0 1 0 

0 


-3 

-13 

3 

-10 



3 10 

2 



- 

1-4 1 

-3 


3 

1 

0 

2 




.1 -2 5 

1 . 


- 

0 0 0 

1 . 


.-4 

-22 

5 

-14 

J 


0 

0 1 

0 “ 


“1 

0 - 

¥1 

= 

■ 0 

0 

1 

0 " 

> 

- 

3* —13 3 - 

10 


0 

1 0 

0 


-3 

0 

3 

0 



3 

1 0 

2 


0 

0 1 

0 


3 

-12 

0 

-8 


_- 

4 

-22 6 - 

14. 


.0 

0 0 

1 - 


.-4 

-¥ 

5 

-l-l 



r 

0 0 1 


0 

n 


’10 0 

0 ' 

= 

• 0 

0 

1 

O' 




-3 0 3 


0 



0 10 

0 


-3 

0 

3 

0 




3-12 0 

— 

8 * 


0 0 1 

0 


3 

0 

0 

-8 



- 

-4 5 

- 

2 

X 

J 


-0 -f o 

1 . 


.-4 

-¥ 

5 

-|J 



By three interchanges of its columns the final matrix can be brought 
to the triangular matrix 

'1 0 0 O' 

3-3 0 0 

0 3-8 0 

-5 -4 -§ -V- 

Hence | a | = lx(-3)x(-8)x(-^)x(-l ) 3 = 88 , as before. The 
product of the multipliers in the present case is 


M x M 2 M z = 


0 1 
0 0 
1 -1 
L 0 0 


¥ §1 

0 1 

i 

1 


This matrix has unit determinant. It can be represented as a triangular 
matrix by a rearrangement of its rows, but not by a rearrangement 
of its columns. 


(iii) Columns, but not Rows, taken in Consecutive Order. With a 
again as for example (i), an illustrative sequence of operations is 


n 

4 

1 

3' 


'1 

2 

-5 

- r 

= 

'1 

6 

-4 

2 ' 

0 -1 

3 

-1 


0 

1 

0 

0 


0 

-1 

3 

-1 

3 

1 

0 

2 


0 

0 

1 

0 


3 

7 

-15 

-1 

Ll* —2 

5 

1 . 


.0 

0 

0 

1 - 


.1 

0 

0 

0 . 
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"1 


6 

-4 2' 


r 

1 

0 

0 

0 " 

= 

“1 

6 

14 

-4" 

J 

0 

— 

1 * 

CO 

1 

h* 


0 

1 

3 - 

1 


0 

-1 

0 

0 


3 


7 - 

15 -1 


0 

0 

1 

0 


3 

7 

6 

-8 


.1 


0 

—i 

o 

o 


L 

0 

0 

0 

1 - 


.1 

0 

0 

0 - 



“1 

6 

14*-4 

1 


"1 

0 

0 

0 " 

= 

‘1 

6 

14 

0 

1 


0 

-1 

0 0 

• 


0 

1 

0 

0 


0 

-1 

0 

0 



3 

7 

00 

1 

CO 


0 

0 

1 

2 

7 


3 

7 

6 

-¥ 


-1 

0 

0 OJ 


-0 

0 

0 

1 - 


.1 

0 

0 

0 

J 


If the rows of the final matrix are taken in the order 4, 2, 1, 3— 
corresponding to the row order chosen for the starred elements—the 
result is the triangular matrix 

"1 0 0 0 " . 

0-10 0 
1 6 14 0 

-3 7 6 -±±- 

From this it is seen that | a | = lx( — 1) x 14 x (— x (— l) 4 = 88. 


4-4. Reduction of Triangular and Related Matrices to 
Diagonal Form. The reduction of a triangular matrix to the diagonal 
form is a simple matter. If the given non-singular matrix is 


I-Tu 

0 

0 

0 1 

> .(i) 

r 21 

T 22 

0 

0 


r 31 

T 32 

T 33 

0 


-^41 

^42 

T 43 

T 44- 



then by operations on columns only 


>u 

0 

0 

0 ' 


" 1 

0 

0 

0" 

= 

>11 

0 

0 

0 ■ 

*21 

T 22 

0 

0 


0 

1 

0 

0 


T 21 

T 22 

0 

0 

**31 

T 32 

T 33 

0 


0 

0 

1 

0 


T 31 

T 32 

T 33 

0 

**41 

T 42 

^43 

7*4 


_Ui 

7 44 

T 42 

T 44 

J[± 3 

T 44 

1 


0 

0 

0 

-1 


The result of the postmultiplication is that all the elements of the last 
row with the exception of r 44 are annulled, but the matrix is otherwise 
unaltered. Clearly, by a succession of such operations r is reducible to 
the diagonal form 


7 

0 

0 

L 0 


u 


0 

T 22 

0 

0 


0 

0 

T 33 


0 l 

0 

0 

T 44J 


•( 2 ) 
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If the given matrix is a triangular matrix with its rows interchanged, 
a similar method may be adopted. For instance, if 


-Oil 

<7 'l2 

0*13 

o - 

&21 

0*22 

0 

0 

°31 

<r 32 

^33 

° r 34 

.0-41 

0 

0 

0 _ 


then by three successive postmultiplications this matrix is reducible to 


- 

‘ 0 

0 

^13 

0 " 


0 

0*22 

0 

0 


0 

0 

0 

^34 


-0*41 

0 

0 

0 - 


4*5. Reciprocals of Triangular and Related Matrices. The 

reciprocal of the triangular matrix r (see (4-4-1)) can be obtained as 
follows. Consider the system of linear equations 


Vi 

— h v 

Toi 

z^2/i+ Vi 

T 11 

= h 2 , 

Toi Ton 

f : vi+^y i + y 3 

T 11 T 22 

II 

^yi+~y i +~ y 3 +y* 

T 11 T 22 T 33 

— ^ 4 , 


in which h v h 2 , h Z) & 4 are arbitrary. These equations are equivalent to 
tS~ x y = h (where S is the diagonal matrix (4-4-2)) and their solution 
can be written y = 

where a)* 1 — rS~ l or r -1 = Hence r _1 can be obtained by multi¬ 
plication of the successive rows of co by 1 /r n , l/r 22 , 1 /r 33 , 1 /t 44 , re¬ 
spectively. 

Now the elements in the first column of oj are the values of y when 
h = {1,0,0,0}: similarly, the elements in the second column are the 
values of y when h = {0,1,0,0}; and so on. But when h = {1,0,0,0} 
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This suggests the following scheme of computation. The scalar multi¬ 
pliers by which the rows of o) have to be multiplied to give r- 1 are 
written on the extreme right. 

(l/Tll) 

(l/r«) 

U/r M ) 

(l/r«) 

Ruled: (i) To form the left-hand array (X) enter blanks in, and to the 
right of, the principal diagonal. Derive the remaining elements from 
r as shown. 

(ii) Commence the right-hand array ( Y) by entering units in the 
principal diagonal, and ciphers to the right of that diagonal. 

(iii) Calculate the remaining elements of a) in succession by the 
following method. To obtain a) {j postmultiply the row of ( X ) level with 
( 0 i5 by the part-column of ( Y) standing above Blank elements of 
(X) are to be disregarded. 

(iv) To find r -1 , multiply the rows of o) respectively by the scalar 
factors on the extreme right. 

Column No. 1 of (F), for instance, is completed as below: 



A similar procedure is possible if the given matrix, say cr, is derived 
from a triangular matrix r by interchange of its rows, but the con¬ 
struction of the arrays (X) and (F) in this case requires care. The 
simplest and safest method is to convert cr first to the triangular form 
by actual interchange of the rows. Suppose for example 







4-5 RECIPROCALS OF TRIANGULAR MATRICES 105 

The stars indicate the last non-zero elements of the rows. If the rows 
are taken in the order 4, 2, 1, 3 to bring the starred elements into the 
principal diagonal, the resulting matrix is triangular. This process can 
be represented by 


'0 

0 

0 

1 ” 


‘°u 

a 12 

0?8 

0 ~ 

= 

'<r*i 

0 

0 

0 “ 

0 

1 

0 

0 


^21 

°22 

0 

0 


°*1 

<r*2 

0 

0 

1 

0 

0 

0 


O 31 

°32 




O'11 

er 12 

°*13 

0 

.0 

0 

1 

0 _ 



0 

0 

0 _ 


-O'ai 

Cr 32 

°*33 



For brevity denote this as Na — r . . (1) 

Then the reciprocal of o' is given by 

or- 1 = t~ x N. .(2) 


The appropriate multiplier N in any given case is found by the following 
rule: N is the transposed of or with its starred elements replaced by 
units and all other elements replaced by ciphers. 

If o* is derived from a triangular matrix r by interchange of its 
columns, then the equations corresponding to (1) and (2) are crM = r 
and o'” 1 = Mt" 1 . In this case the first non-zero elements of the 
columns of or are starred, and the multiplier M is then derived as before. 


Examples 


(i) Reciprocal of Triangular Matrix. Suppose (see example (i) of § 4*3) 


0-1 0 0 
3 -11 -36 0 

.1 -6-14 


The first step is to prepare the following scheme in accordance with 
rules (i) and (ii). 


(1) 
(-D 
(~ A) 
(A) 


X Y (matrix w) 


— — — — 

10 0 0 

0 — — — 

10 0 

-3 -II — — 

1 0 

1 

i 

CO 

1 

1 

1 


To complete the first column of oj, we calculate in succession 


< 0*1 = [ 0 ]{ 1 } = 0 , 

<w 8 i = [—3, —11]{1, 0} = —3, 
w» = [-1.-6,-&]{!, 0,-3} = *. 
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Similarly, g> 32 = [ — 3, — 11]{0, 1} = — 11, and so on. The results are 
entered in ( Y) as they are obtained, and give 


Hence 


(O ' 


1 0 0 
0 10 
-3 -11 1 

L i -tt -A 



n H m 


0 “ 

0 

0 

1 _ 

0 “ 

0 

0 


, 3 

L_4* 


31 


7 9 

"4¥ W2. 


(ii) Reciprocal of Triangular Matrix with Rows Interchanged . 
Suppose cr = f 3 —11 —36* 0 


0 - 1 * 0 0 

1 -6-14 

1 * 0 0 0 


Taking the rows in the order 4, 2, 1 , 3 to bring the starred elements 
into the principal diagonal, we have the triangular matrix already 
considered in example (i). The appropriate multiplier N is here 

’0 0 0 1 '. 

0 10 0 
10 0 0 
.0 0 10 . 

Hence, by the result of example (i). 


II 

r -4 

1 

b 

" 1 

0 

0 

0 “ 


"0 

0 

0 

1 " 


0 - 

-1 

0 

0 


0 

1 

0 

0 


i 

T2 

li - 

1 

YS 

0 


1 

0 

0 

0 


3 

l _ 4 * "" 

31 

Ti ~ 

7 

Ti 

9 

TSJ 


_0 

0 

1 

0 _ 

= 

‘ 0 

0 

0 

1 

■ 

. 





0 

-1 

0 

0 








tt 

0 

rz 






A 

-tt 

9 

TZ 







4*6. Computation of Determinants. The abridged schemes 
of computation of determinants now to be described are based on the 
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theorems given in §§ 4*2 and 4*3. The treatment is applicable generally, 
but for brevity the given determinant will be assumed to be of order 4. 
Thus A = I a I, where 


= 

~ a ll 

a i2 

«13 

«14~ 


a 21 

a 22 

a 23 

a 24 


a zx 

a 32 

a 33 

a 34 


_ a 41 

a 42 

a 43 

a 44_ 


The process consists effectively in the reduction of the matrix a to a 
triangular form. Many methods of computation of determinants are 
of course based on the same principle, but the reductions will here be 
expressed by means of matrices. 

A convenient abridged scheme of computation, which corresponds 
to the sequence of postmultiplications represented by equations 
(4*3*1, 4*3*2, 4*3*3), is as follows: 


(A) 


o u * 

a ia 

a i3 

a i4 

a n 

a ll 

g 14 

a ll 

®21 

a 22 

®23 

«24 

1 

0 

0 

®31 

®32 

a 33 

°84 

0 

1 

0 

a xi 

«42 

®43 

°44 

0 

0 

1 


bn* 

bn 

^13 

bn 

bn 

bn 


(B) 

bn 

b n 

b 2 s 

1 

0 

(B') 


bn 

b» 

bn 

0 

1 


(C) 


<ai* 


_£l2 

Cll 

<C') 





C 22 

1 




(D) 


dn* 





(AO 


The value of A = | a | is given by the product a n 6 u c n d 11 . 

The top left-hand array (A) is the array of the elements in the given 
determinant A. The leading element a n (shown “starred”) is divided 
into the remaining first row elements, and the results with their signs 
changed form the first row of the top right-hand array (A'); this array 
is completed by the elements of the unit matrix I 3 . To obtain the array 
(B), the starred row is omitted from (A), and the remaining rows are 
postmultiplied by (A'), in accordance with (4*3*1). The array (C) is 
derived from (BV and (D) from (Cj, in a similar manner. 

In the preceding arrangement the starred element on the left is in 
each case the leading element of the corresponding array. However, 
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4 - 6-47 


by slightly generalising the scheme, we can remove this restriction and 
star any convenient element in any left-hand array. The rule in this 
connection is as follows: 

Ride: If the chosen starred element in any left-hand array (say B) 
lies in the ith row andjth column of that array, then the corresponding 
right-hand array (B') is constructed from the ith row of (B) and from 
a unit matrix as previously, except that the elements derived from (B) 
must be.entered as the jth row of (B'). The value of A is then the product 
of the starred elements multiplied by the factor (-1)^ where N is 
the total number of interchanges required to bring the starred elements 
into the leading places of their arrays. 

Example 

Evaluate the determinant 


559-2 

0 

24-0 

0 

174 

0 

601-3 

0 

117-2 

0 

0-21 

1-899 

-1 

1-862 

0-21 

1-258 

0-005 

0-669 

-1 

0-580 

0-008 

0-832 

0-008 

0-670 

-1 


669-2 

0 

24-0 

0 

174 

1 

0 

0 

0 

0 

601-3 

0 

117-2 

0 

0 

1 

0 

0 

0-21 

1-899 

-1* 

1-862 

0-21 

0-21 

1-899 

1-862 

0-21 

1-268 

0-006 

0-669 

-1 

0-580 

0 

0 

1 

0 

0-008 

0-832 

0-008 

0-670 

-1 

0 

0 

0 

1 


664-24 

46-676 

44-688 

17904 

1 

0 

0 



0 

601-3 

117-2 

0 

0 

1 

0 



1-39849 

1-27543 

0-246678 

0-72049 

0 

0 

1 



0-00968 

0-847192 

0-684896 

- 0-99832* 

0 009696 

0-848618 0-686049 




665-976 

197-513 

167-518 

1 

0 





0 

601-3 

117-2* 

0 

1 





1-40648 

1-88685 

0-739969 

0 

-5-13065 






565-976 

-661-946 

1-35868 







1-40648* 

-1-90960 

1 




TTat 

l/to 



107034* 






A = (-l)x(- 0-99832) x 117-2 x 1-40548 x 107-034 x (- l) u 
= 17601-3. 

4-7. Computation of Reciprocal Matrices. Three different 
methods will be described. They will be dealt with under separate 
headings, and will be referred to as (i) the method of postmultipliers, 
(ii) the method of submatrices, and (iii) the method of direct operation 
on rows. 
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In the first two methods definite rules for procedure must be learnt. 
In the third the operations are extremely simple, and are left to a large 
extent to the judgment of the computer. 


4*8. Reciprocation by the Method of Postmultipliers. It has 

been shown in §4-3 that by the application of ft — 1 successive post¬ 
multipliers, M v M 2 , ..., M n __ v each of unit determinant, a given non¬ 
singular matrix a of order n can be reduced to a matrix <r, which if not 
immediately triangular is a triangular matrix with its rows inter¬ 
changed. The reciprocal of cr can be found readily by the methods of 
§ 4-5, and then 


T-l - 


= (M l M i ...M n _ 1 ) cr-K 

The construction of cr and of the product M n _ 1 can be effected 

simultaneously by a slight extension of the scheme already described 
in §4-6 for the computation of determinants. To avoid unnecessary 
complication the starred elements will be restricted to lie in the first 
columns of their arrays, although they need not necessarily lie in the 
first rows. A representative abridged scheme of computation, appro¬ 
priate to a matrix of order 4, is as follows. 


(A) 


Oil 


<hs 


<h* 

«ia 

0|4 

°14 



— 

“ii 

°8l 

°31 

* 

«** 

°*a 

°24 

1 

0 

° ( 

ati 

<h% 


«B4 

0 

1 

0 

°41 


*43 

«44 

0 

0 

1 


— 

— 

— 


a* 


(B) 

b a 

At 

At 

b» 

~b n 

^ss 

~b n 

(»') 

K 

b„ 

A. 

1 

0 



bi 

bm 

At 

0 

1 





— 

A 






— 

A 



(C) 


* 

Sll 

C 11 


(O') 


Cn 

C 11 





Cfl 


1 




(D) 





(AO 


Rules : (i) Proceed as for the computation of determinants (see 
§4*6), but introduce blank upper rows on the left as necessary, so as 
to preserve a constant number of rows. Temporarily leave empty 
rows on the right opposite the blank rows on the left. 
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(ii) After calculation of d n fill the empty rows on the right in 
succession. To find \ot v a 2 ] premultiply the two part-columns of (B') 
already determined by the top row of (A'). Similarly, to find {J3 V /? 2 } 
premultiply the part-column of (C') already determined by the two 
top rows of (B').t 

(iii) To form the matrix or, set down in succession the first columns 
of the left-hand arrays (disregarding places filled by blanks), and 
always write ciphers to the right of each starred element. 

(iv) To form the product (M 1 M 2 M 3 ), write down the successive 
first columns of the right-hand arrays, and precede them by the column 
{ 1 , 0 , 0 , 0 }. 

In the particular case taken the matrices cr and M X M 2 M 3 formed in 
accordance with the rules given would be 


"«11 

*11 

r* 

c n 

0 ‘ 

; M 1 M 2 M 3 — 

1 

“ a 32l a 31 

a l 

Pi ■ 

*21 

^21 

C 21 

<*fi 


0 

1 

~ ^32/^31 

P2 

®31 

0 

0 

0 


0 

0 

1 

- C 12/ C ll 

-«41 

^31 

0 

0 


_0 

0 

0 

1 


Moreover, | a | = « 3 i 6 31 c 11 d 11 ( — l) 4 . 


The reciprocal cr -1 can now be found by the methods given in §4-5, 
and then a -i = (M^MJ <r~\ 


If required the adjoint of a can also be deduced by use of the relation 
A = a -1 1 a |. It may be noted that, since or- 1 necessarily contains a 
column proportional to { 0 , 0 , 0 , 1 }, one column of a -1 —and therefore 
of the adjoint A —will be proportional to the last column of M 1 M 2 M 3 . 
Accordingly, the adjoint has a column proportional to the last right- 
hand array of the abridged scheme. 


Suppose 


Example 


0-1 3-1 

3 10 2 
.1 -2 5 1 - 


More generally, to complete any right-hand array (K') t premultiply the submatrix 
of K ' already determined by the appropriate submatrix taken from the top of the com¬ 
pleted preceding right-hand array. 
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The abridged scheme which follows 
calculations given in extenso in examp! 

14 13 

/ A \ 0 - 1 3-1 

w 3 1 0 2 

1* -2 5 1 

should be compared with the 
le (iii) of § 4 - 3 . 

2 -5 -1 

1 0 0 (A'\ 

0 1 0 > 

0 0 1 

(B) 6-4 2 

-1* 3 -1 

7 -15 -1 

1 -3 

3 -1 (B') 

10 

0 1 

< c) t 4 . : 4 

6 -8 

-V 1 

-f 

------ (C') 

f 

1 

(D) -V* 



Hence 


'1 

6 

14 * 

0 " 

and M 1 M 2 M 2 = 

“1 

2 

1 

19-1 

“7 

0 

-1* 

0 

0 


0 

1 

3 

1 

-y 

3 

7 

6 

44 * 

7~ 


0 

0 

1 

2 

7 

_1* 

0 

0 

0 


_0 

0 

0 

1 ^ 


Also | a | = lx(—l)xl 4 x(-^)x(-l)‘ = 88. 

Note that the array (B') is completed by 


[2 -5 




= [ 1 .- 3 ], 


and that (C') is then completed by 




To find (T -1 the method already illustrated in example (ii) of § 4-5 can 
be used. Thus 


'0 

0 

0 

1 " 

and 

Ncr — t = 

'i* 

0 

0 

0 ~ 

0 

1 

0 

0 



0 

-1* 

0 

0 

1 

0 

0 

0 



1 

6 

14 * 

0 

_0 

0 

1 

0 



_3 

7 

6 

44* 

7 
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The scheme for the computation of «is here 


4 * 8 - 4*9 


Hence 



1 

0 

0 

0 

(1) 

0 

1 

0 

0 

(-1) 

-1 

6 

1 

0 

(A) 

-V 

V 


1 

<-*> 


Y (matrix <u) 


A ~AJ 

Finally, a -1 = (M 1 M 2 M 3 ) <x~ x 


0 

1 

0 

0 


0 

0 

0 

1 


= 

'1 

2 

1 



'0 

0 

0 

1" 


0 

i 

3 

-* 


0 

-1 

0 

0 


0 

0 

1 

f 


A 

f 

0 

“A 


.0 

0 

0 

i . 


-A 

31 

I X 


A. 




-5 

9 

4 


15 19 -8' 

17 1-12 

10 -2 2 


3 -31 -7 18 


If required, the adjoint of a can now be deduced by use of the relation 
A = o _1 |o|, and the known value |o| = 88. One column of the 
adjoint is proportional to {19, 1, —2, —7}, or to column (O') in the 
abridged scheme. 


4*9. Reciprocation by the Method of Submatrices. This is an 
independent method based on the properties of partitioned matrices. 

Suppose a to denote a non-singular Bquare matrix of order m. Then 
a can be partitioned as below into four submatrices: 

a = roller), a 12 (f,8)1. 

Lan&r), “a(«. »)1 

The orders of the submatrices are as indicated, and r+8 = m. 

Let the reciprocal J3=a~ x be correspondingly partitioned; thus 


fi- 


Ai(«»> /?«(«>«) J 
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Then, since floe = J m , we have the four relations 

fin a ii+fii 2 a 2 i ~ It* 

Ai a i*+ flit°hz = 

Ai a n+^ 22*21 = 

Al a 12 +^22 a 22 ~ ^8* 

These equations may now (in general) be solved to give the submatrices 
of fl explicitly. The results may be expressed as follows. Let 

= G'll‘3'12 > ^ = ^Oii 1 ; 0 = a 22 ^^12 ~ a 22 

Then fl 11 -oe^ 1 + X0^Y 9 

A 12 = -X0~\ 
fl n = -0-'Y f 
fin -o- 1 . 

These formulae serve to determine fl provided the reciprocals a^ 1 and 
0- 1 exist. If afi 1 is known, the quantities X , Y, 0 can be calculated, 
and fl can then be deduced. An arrangement of the numerical work 
which is convenient and self-explanatory is suggested below: 



«11 

«n 

X — OCiVoCm 

*ii l 


5- 1 

T = otjj^au 1 



or 1 = 

i 

i 

1 - 

N 

I 


1 

1 

i 

0- 1 J 


In the simplest application r = m — 1 and 5=1: then a 21 is a single row, 
a 12 is a single column, a 22 is a single element, and 0 is a scalar. More¬ 
over, since in this case 0- 1 is the last diagonal element of ar\ namely 
I a n l/l a l> we have the relation 

6 =i a i/i *u i- 


(i) 
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The general procedure in dealing with a given numerical matrix 
will now be explained. Suppose the matrix to be 


a = 

°n 

#12 ••• 

a l n 


a 21 

#22 ••• 

a 2 » 


~ a wl 

a n2 ••• 



Then a sequence of matrices S v S 2 , S 3 , etc., progressively including 
more and more of the principal diagonal elements of #, can be formed 
as follows: 


Si — [#n], 


» 

III 

a U a i2l » 




.®21 ^22J 



s* 

III 

#H # 12 #13 

= 

« >1 


a 21 a 22 a 28 


J «28 


#31 #32 #33_ 


-®81 a 32 ! a 33- 


III 

"°U 

a 12 

a l3 

1 

s 

= 



1 

1 

a u 


«21 

a 22 

a 23 

a 24 



S 3 

1 

1 

«24 


°31 

a 32 

a 33 

#34 




1 

1 

°34 


- a il 

a 42 

a 43 

a 44_ 


- a a 

#42 

«43 ! 

«44. 


and so on. 

Commencing with the second of these, we can write down the 
reciprocal of S 2 immediately; thus 

£-1 =-i-[ ”°u|. 

a U a 22 ” a l2 a 2l L ~ a 21 a UJ 

Using the value of S 2 \ we can now apply to S 3 the scheme of computa¬ 
tion just described, and so obtain S j 1 . Next, using S 3 l , and applying 
the scheme to $ 4 , we derive Sf 1 . Proceeding in this way, we finally 
obtain S~ l or or 1 . 

In the exceptional case where a singular matrix, say S if is en¬ 
countered (as indicated by 0 = 0), the simplest procedure is to transfer 
the ith row of # to last place, and then to continue as normally. This 
rearrangement of the rows will be equivalent to premultiplication of a 
by N, where N is the unit matrix with its ith row transferred to last 
place. The final reciprocal matrix as computed will then have to be 
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postmultiplied by N to give a * -1 (compare (4-5-2)). More generally, 
if several rows of a are transferred in the foregoing way during 
the computations, the appropriate multiplier N can readily be con¬ 
structed. 


If 0 f denotes the value of 6 obtained in the construction of S^ 1 , 
then by (1) , 


Hence 


I a I “ ®n®n -1 -*• ^2^1- 


Examples 

(i) Normal Case (Matrices 8 all Non-singular). Suppose the given 
matrix to be (see example, §4-8) 

a = ri 4 1 31. 

0-1 3—1 

3 10 2 

-1 -2 5 1. 

Then S 2 = p 4 j = S 2 K 

The scheme for the computation of S 3 1 is as follows: 

; xe-*Y = r-H -wi. 

L i HJ 

6 
Y 



Y 
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Hence (compare result of example in § 4-8) 




r "& 


if if 

Ti il Ti 
Tf if ~_TS 

L A -a-A 
= A| 


-A 

■A 

A 


-6 16 
9 17 

4 10 

L 3 -31 
The determinant | a | is here given by 

0 4 0 8 |S a | = ^x(-36)x(-l) = 88. 
(ii) Case of Singular Matrix S. Suppose 


T5J 
19 -8 

1 -12 
2 

18J 


-2 

-7 


a = 


Here 


& = 


1 1 
1 2 
1 0 
L3 -1 
and 


2 1 
4 3 
0 2 
1 5J 


[i a “ d -;]■ 

Proceeding to $ 3 , we find that 0 = 0: thus 


1 

0 

0 

2 

-1 

2 

-1 

1 

4 

2 

-1 

<35 

II 

O 


This result indicates that S 3 is singular. Transferring the third row of 
a to last place, and continuing as normally, we have 


X 

e- 1 



3 -1 

1 

0 

2 -1 

2 

2 

-1 1 

4 

i 

7 -4 

3 


xe~ i T = 


t; .a- 


Lastly, 


e~ l 



1 0 0 

2 

-1 

2-10 

1 

-¥ 

i 

1 

«*• 

3 

¥ 

-111 

6 

* 

2-10 

3 


xe~ i r = 


r-f i oi. 
-¥ v o 

V- -V 0J 
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I -I o ilp o' 

I -i -I ^ 0 10 0 

-i I I -¥ oooi 
■t t o iJLo o 1 o. 

12 -6 3 0 '. 

5 -1 14 -6 

-1 2-10 3 

-6 3 3 0_ 

(iii) Specimen Calculation for Sixth Order Matrix. In the preceding 
examples the successive steps have been explained in some detail. 
To illustrate the compactness of the method, we shall now compute 
the reciprocal of a symmetrical sixth order matrix, omitting all un¬ 
essential steps. The given matrix is 

a=|~ 10-472 0-506 0 -3-935 -0-521 0 

0-506 11-016 5-000 -0-521 -1-046 3-750 

0 5-000 26-000 0 0 -1-050 

-3-935 -0-521 0 6-322 0-536 0-355 

-0-521 -1 046 0 0-536 2-737 0 

0 3-750 -1-050 0-355 0 3-881 

and the computations begin with S^ 1 bordered (see next page). 

To illustrate the accuracy, the product of the given matrix a and 
the computed reciprocal is given below: 

aa~ x = 

0-999995, 0-000000, 0-000000, 0-000003, 0-000000, 0-000000 

0-000000, 1-000003, -0-000001, 0-000001, 0-000001, -0-000002 

0-000000, 0-000003, 0-999999, 0-000001, 0-000002, -0-000006 

0-000002, 0-000000, 0-000000, 0-999996, 0-000000, 0-000000 

0 - 000000 , 0 - 000000 , 0 - 000000 , 0 - 000000 , 1 - 000000 , 0-000000 

0-000000, 0-000001, 0-000000, 0-000000, 0-000000, 0-999998 
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0 

6*000 

26000 

-0*0219802 

0*0967062, 

-0*00439604 

0 

0*464895 

-0*00439604, 

0*0909790 

6*000 

0*0421487 

-0*0219802, 

0*464895 

23-7255 



-3*935 

-0*521 

0 

6*322 

-0*374170 

00957266, 

— 0*00481747' 

0 000926437 

-3*935 

-0*0329874 

-0 00481747, 

0*0997008 | 

-0*0191732 

-0*521 

0 00634371 

0 000926437, 

-0*0191732 | 

0 0421487 

0 

0*206934 

-0*374170, 

-0*0329874, 

0 00634371 

4*83245 



-0*521 

-1046 0 

0*536 

2*737 

-0*0210980 

0*124697, 

-0 00226331, 0*0004352531 

0*0774285 

-0*521 

-0*0996846 

-0*00226331, 

0*0999260, -0*0192165 | 

0*00682621 

-1046 

0*0191701 

0*000435253, 

-0*0192165, 0*0421570 

-0 00131273. 

0 

0*0634362 

0*0774285, 

0*00682621,-0*00131273 j 

0*206934 

0*536 

0*386438 

-0*0210980, 

-0*0996846, 0*0191701, 

0*0634362 

2*68774 



0 3*750 -1*050 0*355 0 

3-881 

0*0215710 

0*411631 

-0*119644 

0 0913330 

0*143533 

0*124869, -0*00145057, 0 000278958, 0*0769113 ! 0*00815307 

-0*00145057, 0*103766, -0*0199550, 0*00438253 ! 0*0385219 

0*000278958, - 0*0199550, 0 0422990, - 0*000842790; - 0*00740806 

0*0769113, 0*00438252,-0*000842790, 0*208489 j-0*0245142 

0 

3*750 

-1*050 

0*356 

0 

0*00815307, 0*0385219, -0*00740806, -0*0245142 | 0*386438 

0*458833 

0*0215710, 0*411631, -0*119544, 0 0913330, 0*143533 

2*17944 


0*125082, 

0 00262354, -0*000904232, 

0*0778153, 

0*00957369 

-0*00989749 

0*00262354, 

0*181511, -0*0425333, 

0*0216326, 

0*0656310 

-0*188870 

-0*000904232, 

-0*0425333, 

0*0488561, 

-0 00585247, 

-0*0152809 

0-0548607 

0*0778153, 

0*0216326, -0*00585247, 

0*212316, 

-0*0184992 

-0*0419066 

0*00957369, 

0*0656310, -0*0152809, 

-0*0184992, 

0*395891 

-0*0658577 

-0*00989749, 

-0*188870, 

0*0548507, 

-0*0419066, 

-0*0658577 

0*458833 
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4*10. Reciprocation by Direct Operations on Rows. This 
method offers the great advantage that no formulae have to be 
memorised.The underlying principle is the reduction of the given matrix 
a to a triangular form and then to the unit matrix by a succession of 
simple operations on rows. These operations are simultaneously per¬ 
formed on the unit matrix, which is ultimately transformed into the 
required reciprocal. The process is thus essentially the same as that 
given in § 4*8, but with premultipliers used instead of postmultipliers. 
However, the actual scheme of numerical calculation is considerably 
different. 

A simple example will make the procedure clear. Suppose the given 
matrix to be 


a 


“36 16 4“ 

15 9 3 

6 4 2 


Then the required reciprocal is such that 


K) 

36 

16 

4" 

a- 1 = 

“1 

0 

0" 

(r 2 ) 

15 

9 

3 


0 

1 

0 

(r 3 ) 

. 6 

4 

2 


0 

0 

1. 


The symbols r t are used in this and in subsequent equations merely to 
identify the rows. Now operate on the left-hand and right-hand rows 
of the equation in the manner indicated below: 


Operation 

ifi 

New Row No. 

U 

"9 

4 

1“ 

a~ x — 

ri 

0 

0 - 


U 

5 

3 

1 


0 

i 

0 

¥9 

U 

.3 

2 

1. 


0 

0 

i 

U “*• 

r i 

'6 

2 

0" 

a” 1 = 

ri 

0 


r 6~ r 9 

r 8 

2 

1 

0 


0 

i 

-* 

— 

U 

.3 

2 

1. 


0 

0 

iJ 


The last column of the matrix on the left now contains only one non¬ 
zero element (namely the unit), and one further operation completes 
the reduction to a triangular form. Thus 


Operation 

¥i-h 


New Row No. 

r 9 

*8 

U 


—1 

»—• 

O 

O 

a- 1 = 

i —i i 

210 


0 i -i 

L 3 2 1 . 


0 0 i 
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The foregoing reduction to triangular form can conveniently be 
arranged as below: 


Operation 

Row No. 

Left Array 

Right Array 

— 


36 

10 

4 

1 

0 0 

— 

'i 

16 

9 

3 

0 

1 0 

— 

'» 

6 

4 

2 

0 

0 1 

to 

U 

9 

4 

1 

i 

0 0 

to 

u 

5 

3 

1 

0 

i 0 

to 

u 

3 

2 

1 

0 

0 t 

r 4 -r. 

'7 

6 

2 

0 

i 

0 -i 

r,-r. 

U 

2 

1 

0 

0 

i -i 

to-»« 

u 

1 

0 

0 

i 

-i i 

— 

u 

1 

0 

0 

* 

-i * 

— 

u 

2 

1 

0 

0 

i -i 

— 

u 

3 

2 

1 

0 

0 i 


The value of | a |, if required, is now given by 
M = (ixixi) _1 = 48. 

The operations on rows can be continued to effect the reduction of a to 
the unit matrix. Thus 


Operation 

Row No. 

Left Array 

Right Array 

r,-2r. 

*10 

0 

1 

0 

-i i -i 

r,-2r. 

'n 

0 

2 

1 

-i i -i 


'i* 

0 

0 

1 

i -i i 

— 


1 

0 

0 

i -i i 

— 

f 10 

0 

1 

0 

-i i -i 

— 

'l* 

0 

0 

1 

i -i i 


The final right-hand array gives the required reciprocal. 

4*11. Improvement of the Accuracy of an Approximate 
Reciprocal Matrix. If an approximate reciprocal matrix has been 
obtained, and greater accuracy is required, the following method is 
usually effective. Let a be the given matrix, p the approximate re¬ 
ciprocal, and p+Sp the exact reciprocal. Then a(p + dp) — I, or 

a dp — 1—ap. 

Premultiplication by p gives, approximately, dp = p(I—ap). Hence 
the next approximation to the reciprocal is 

p+dp = p(2l-ap). 
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The conditions for the convergence of this method of approximation 
are readily obtained. If p = p 0 and p + Sp = p v the last equation gives 

I-ap 1 = I — ap 0 (2I — ap 0 ) = ( I-ap Q ) 2 . 

Similarly, for a second approximation, 

7-ap a = (/-apO 2 = (J—ap 0 ) 4 , 
and generally I-ap r = (I — ap 0 ) 2f . 

Hence provided that p 0 is such that the latent roots of J — ap 0 all have 
moduli less than unity, I — ap r will tend rapidly to zero as r increases, 
i.e. p r will tend rapidly to a -1 . In particular, this condition is satisfied 
if p 0 is a fairly accurate reciprocal of a, so that the elements of / —ap 0 
are all small. In practice, if a very accurate reciprocal of a matrix is 
required, it is probable that labour will be saved and the possibility 
of error reduced by a preliminary calculation of an approximate 
reciprocal and a subsequent application of one or two corrections 
according to the above scheme. 


Example 


If a =* [7 3], then the exact reciprocal is 



Suppose a known approximate reciprocal to be 

p = T 0-998 -3-0051. 
[-1-994 7-013J 


Then 2/-op=T2 01-T1-004 0-0041 = f 0-996 -0-0041. 

[o 2j [o-002 1-003J [-0-002 0-997J 

Hence the next approximation to a* 1 is 


p(2l—ap) = T 0-998 

-3-0051 

' 0-996 

— 0-004' 

[-1-994 

7-013 

-0-002 

0-997. 


1-000018 -2-999977". 
-2-000050 6-999937 


4*12. Computation of the Adjoint of a Singular Matrix. The 

adjoint A of a non-singular matrix a can be obtained by the first or 
the third of the methods of reciprocation already described, and use 
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of the relation A = | a | ar\ Attention will here be restricted to the 
computation of the adjoint of a singular matrix. It will be remembered 
(see §§ 1-10, 1-12) that the adjoint of a simply degenerate matrix has 
proportional rows and proportional columns, while the adjoint of a 
multiply degenerate matrix is null. For many purposes it is sufficient 
to obtain a column of numbers proportional to a non-vanishing column 
of the adjoint, if such a column exists. 

A numerical example will sufficiently explain the procedure, which 
is an adaptation of the method of §4*10. Suppose the given singular 
matrix to be 

a = 2 4 6" • 

3 9 15 

4 16 28 

Let any convenient element, say a 23 (= 15) be given a small increment 
e. Then 


2 

4 

6 

= 

2 

4 

6 + 2 

4 

0 

3 

9 

15+ e 


3 

9 

15 3 

9 

e 

4 

16 

28 


4 

16 

28 

16 

0 


where A 23 denotes the cofactor of a 23 . Hence, if the increment of the 
adjoint due to e is eB, 


’2 

4 

6 ' 

(A + eB) = eA 23 ri 

0 

°1 • 

.(1) 

3 

9 

15 + e 

0 

1 

0 


4 

16 

28 

0 

0 

1 . 



Direct operations on the rows of this equation are now performed, as 
in §4-10, until a row is obtained on the left containing only one non-zero 
element which is proportional to e. Thus 
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The last row obtained yields on division by e/3 (see (1)), 

[0, 0, 1 ](A + eB)-A n [-l 1,-f]. 

Hence when e is made zero 

[0, 0, 1] -4 = [-4 18 , -^23> -^ 33 ] = -^23[ — f> 1* — f]- 
Since A has unit rank, it follows that each row is proportional to 

In practice it is not necessary to introduce the increment e explicitly. 
The operations are performed simply on the matrix a, and are con¬ 
tinued until a null row is obtained on the left. The corresponding row 
on the right will then be proportional to the rows of the adjoint. 

Next, if the transposed of a is taken for the initial left-hand array, 
and the process is repeated, a row will be derived on the right which, 
when transposed, is proportional to the columns of A. The adjoint is 
then determined apart from a scalar multiplier. If required, this 
multiplier can be obtained by evaluation of a non-vanishing first 
minor of a: the calculation is usually simple, since by the preceding 
operations some of the first minors will already have been reduced to 
a triangular form. 

A caution should be added. If during the evaluation of the adjoint 
it is found possible to derive on the left two or more null rows from a, 
corresponding to linearly independent rows on the right, then a is 
multiply degenerate and the adjoint is null. Accordingly, when one 
null row has been obtained it is always advisable to ascertain whether 
another can be derived independently. 

The scheme of computation described can also be viewed as follows. 
Suppose r v r 2 , ...,r n to be the rows of the given matrix a. Since a is 

n 

degenerate, there will be one or more relations of the type 2 ^i r i = 0 

<-1 

between the rows, where the multipliers A { are scalars; the number of 
such relations will equal the degeneracy of a. When a is simply degen¬ 
erate, the row [A V A 2 ,..., A n ] is proportional to the rows of the adjoint 

n 

of a, and in the general case the relation 2 A t r t = 0 is equivalent to 

<-1 

[A v A 2 . A n ] a — 0. Evidently the process described, in which 

various multiples of the rows of a are added until the sum total 
vanishes, amounts simply to a determination of the coefficients A { . 
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Examples 

(i) Simply Degenerate Matrix. The complete calculations for the 
matrix already considered are summarised below: 


Matrix as Given 

Matrix Transposed 

Left Array 

Right Array 

Left Array 

Right Array 

2 

4 

6 

i 

0 

0 

2 

3 

4 

1 

0 

0 

3 

9 

16 

■ 

1 

n 

4 

9 

16 

0 

1 

0 

4 

16 

28 

0 

0 

1 

6 

16 

28 

0 

0 

1 


2 

3 

* 

0 

0 

1 

i 

2 

i 

0 

0 


3 

5 

0 

i 

0 

1 

i 

4 

0 

i 

0 

H 

4 

7 

0 

0 

i 

1 


V 

0 

0 

i 

0 

1 

2 

-1 

1 

0 

0 

i 

2 

-* 

l 

0 

0 

2 

4 

-i 

0 

i 

0 

i 

* 

-i 

0 


0 

1 

2 

-i 

0 

i 

0 

1 


-f 


0 

0 

0 

D 

-i 

i 

-i 

0 

0 

0 

i 

-i 

i 


Hence A = c{£, — £} [— J, — J], where c is an unknown constant. 

Also the cofactor of a 33 is, in the fourth and seventh rows of the above 
table, reduced to triangular form. Hence 

^33 = (lxl)-*-(**£) = 6. 

Alternatively, from the operations on the transposed matrix, 

-433 = (lxf)-(Jxi) = 6. 

Hence c = — 6 x 48 and 

4 “{1,-2, 1} [12, — 16, 6]. 

Note that the row and the column determined have the properties 

[-1, -itt- o. 


(ii) Multiply Degenerate Matrix. If 


a — 



-1 

2 

-4 

-5 



the computations are as follows: 
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Matrix as Given 

Matrix Transposed 

Left Array 

Bight Array 

Left Array 

Bight Array 

2 

-1 

3 

5 

1 

0 

0 

0 

2 

3 

1 

-4 

1 

0 


m 

3 

2 

-1 

4 

0 

1 

0 

0 

-1 

2 

-4 

-5 

0 

1 


0 

1 

-4 

7 

6 

0 

0 

1 

0 

3 

-1 

7 

5 


0 

1 

0 

-4 

-5 

5 

-3 

0 

0 

0 

1 

5 

4 

6 

-3 


a 


1 

1 

-i 

» 

* 

i 

0 

0 

0 

1 

i 

i 

-2 

i 

0 

0 

0 

1 

} 

-i 

4 

0 

i 

0 

0 

1 

-2 

4 

5 

0 - 

i 

0 

0 

1 

-4 

7 

6 

0 

0 

1 

0 

1 

-* 

i 


0 

0 

i 

0 

1 

i 

-t 

i 

0 

0 

0 - 

•i 

1 

4 

* 


0 

0 

0 

i 

0 

i 


-i 

-i 

i 

0 

0 

0 

-i 

i 

7 

-i - 

1 

0 

0 

0 


¥ 

i 

-i 

0 

1 

0 

0 


V 

V- 

-i 

0 

l 

0 

0 

i 

-¥ 

-i 

-i 

0 

0 - 

1 

0 

~ t 7 * 


i 

-i 

0 

0 

t 

0 

1 

-V 

-1 

-f 

2 

T 

0 

0 

0 

1 

-1 

2 

n 




0 

1 

-¥ 

-1 

\ 

0 

-4 

0 

0 

1 

-1 

2 

H 




0 

1 

-V- 

-1 

-f 

0 

0 - 


0 

1 

-l 

2 

♦ 

0 

0 

-1 

0 

0 

0 

0 

f - 

-4 


0 

0 

0 

0 

0 

H - 

•f 

“A 

0 

0 

0 

0 

0 

f - 

-4 

0 - 

•1 

0 

0 

0 

0 

t - 

±_ 

0 

-? 


The process yields two null rows from a, corresponding to linearly 
independent rows on the right, so that the adjoint A is null. The 
following four relations, derived from the rows corresponding to the 
null rows of the left-hand arrays, may be noted: 

[2, - 1 , - 1 , 0] a = 0, 

[1, -2, 0, — 1]a = 0, 
a{5, -11, -7, 0} = 0, 
a{2, - 1 , 0, -1} = 0. 

4*13. Numerical Solution of Simultaneous Linear Algebraic 
Equations. 

(a) Preliminary Remarks on the General Nature of the Solution. 
Before dealing with methods of solution we shall briefly consider the 
general nature of the solution. 

Suppose the given system of m equations in m unknowns a; to be 
ax — h. Then, if a is non-singular, the equations have a unique solu¬ 
tion. On the other hand, if a is singular, the m equations are either 
incompatible or not linearly independent. To make this clear, let us 
suppose a to be of rank r. Then a is equivalent to a canonical matrix C 
which has units in the first r places of the principal diagonal and zero 
elements elsewhere (see § 3-13). We may therefore substitute pCq for a 
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(where p and q are non-singular matrices) and write the given equations 
88 pCqx = h. 

Hence ify = qx and jr x h = Z, we obtain the equivalent set of equations 

Gy = Z. .(1) 

Prom the form of the canonical matrix C it is seen at once that the m 
scalar equations contained in (1) are incompatible unless 

Zr +1 = h+2 = ••• = = 0, 

and that when these conditions happen to be satisfied the solution is 
y i — l i for with y r+1 ,y r+2 , arbitrary. The corresponding 

values of x are in this case given by 

x = Q- X {h> hi • * • > hi Vr+ 1> Vr+2i • • • > ^m}- 

The most general solution of the original equations in x is accordingly 
the sum of a unique column and of arbitrary multiples of m—r other 
columns. For instance, if the equations are 


"2 2 6 3‘ 


x i 

II 

'5' 

6 15 4 


X 2 


5 

4-101 


X 3 


0 

.2 Oil. 


- 1 

■«* 


.1- 


the matrix a is of rank 2, and the most general solution is found to be 
x = {h 2, 0, 0} + a{-£, — 2, 1, (>}+/?{-£, -1, 0, 1}, 

where a and fi are arbitrary constants. 

When a is non-singular and the solution of a system of equations 
ax = h is required with the numbers h left general, the methods of 
reciprocation given in §§ 4-8-4-10 can be used immediately to give the 
required solution x = a~ l h. In the discussion which follows it is 
assumed that the numbers h have assigned values. 

(6) Solution by Method of Postmultiplication. In this method 
operations are performed on columns, and it is necessary to assume the 
equations to be given in the form xa = h, where x and h now denote 
row matrices, and a is a given square matrix. When postmultiplied 
by the n— 1 matrices (see §4-8), this equation becomes 

x(aM 1 M 2 ... M n _f) = x<r = h(M 1 M 2 ... M n ^. 
x = h(M 1 M 2 ...M n _ 1 ) cr -1 . 


Hence 
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The following representative scheme of computation for a system of 
four equations is an obvious modification of the scheme given in § 4*8. 
The essential difference is that, whereas formerly a computation of 
the matrix product was necessary, now only the row 

hM 1 M 2 ... M n _ 1 need be obtained. The calculation of this row is effected 
simultaneously with the reduction of a to cr. 


A, 

A, 

h 


— 

— 

— 

*11 

«1* 

«is 

*14 

*31 

_*33 

*31 

_^4 

*81 


*M 

*28 

*34 

1 

0 

0 

afi 

®M 

*88 

*84 

0 

1 

0 

*41 

°4* 

*43 

*44 

0 

0 

1 


K 

K 


— 

— 



bn 

bn 


A.. 

An 

An 

A„ 



bn 

bn 

a» 

1 

0 



AS 

bn 

bn 

0 

1 




K 

K 

— 

— 




cfi 

Cll 

_<ht 

Cn 





c n 

c M 

1 






A" 







<« 





These calculations give the result 

xct^x a 1:L bn c A * 0 = \h\y h 29 h 3 , A 4 ]. 

a 21 ^21 C 21 ^11 

a 3 * 0 0 0 

_a 4 i 6 3 * 0 0 _ 

Commencing with the last column of a and working backwards to the 
first, we can now derive the values of x 2 , x v x 4 and x 3 in succession. Thus 

•^ 2^11 = ^4 ^ ^2 ~ ^4 n» 

•^ 1^11 + *^ 2^21 = ^3 ^ *^1 = (^3 ^ 21 *^ 2 )> 

, Cl1 
and so on. 

If the entire first column of any left-hand array—for instance 
{^ 3 , Cu, c 21 ]—is null, the given equations are not linearly independent. 
On the other hand, if such a column is null with the exception of its 
first element (for instance, if ^£ + 0 but c n = c 21 = 0 ), then the equa¬ 
tions are incompatible. 
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Examples 

(i) Equations Linearly Independent . As a simple illustration sup¬ 
pose the given equations to be 

x x +3a 3 -h x 4 = 2, 

4^- x 2 + x 8 — 2 x 4 = 1, 

X^ "f* 3*2/2 “i“ ^4 " 1> 

3*Cj x 2 4" 2x 8 “I* x^ = 3, 

or xfl 4 1 31 = [2, 1, -1, 3]. 

0-1 3-1 

3 10 2 
.1 -2 5 1 - 

The calculations which follow should be compared with those in the 
example to § 4-8: 


2 

1 

-1 

3 

— 

— 

— 

1 

4 

1 

3 

2 

-5 

-1 

0 

-1 

3 

-1 

1 

0 

0 

3 

1 

0 

2 

0 

1 

0 

1 * 

-2 

5 

1 

0 

0 

1 


6 

-11 

1 

— — 


6 

-4 

2 

3 

-1 



- I * 

3 

-1 

1 

0 



7 

-15 

-1 

0 

1 




4 

-4 






14 * 

-4 






6 

-8 

1 












_ 

- v * 





Hence fo.a^.arjri 6 14* 0 1 = [2, 5, 4, 


0-1* 

0 

0 


3 7 

6 

“^r* 


Li* o 

0 

0 J 


Accordingly * 3 ( -ty) = -so, 


or 

HB 

ii 

#i(14) — 4 —6#3, 

or 

x i = rt; 


= 5-6x 1 -7x s , or x 2 = -f$, 
x t = 2—x 1 —3x 3 , or x 4 = 
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(ii) Equations not Linearly Independent . As an illustration we may 
take the set of equations (2), which can be expressed as 

-[5, 5,0,1]. 


Then 


£ 

[2 6 

4 

2 1 


2 1 

-1 

0 


5 5 

0 

1 


.3 4 

1 

1 . 

5 

5 

0 

I 

2* 

6 

4 

2 

2 

1 

-1 

0 

5 

5 

0 

1 

3 

4 

1 

1 


-10 

-10 

-4 


-5* 

-5 

-2 


-10 

-10 

-4 


-5 

-5 

-2 


-3 -2 -l 

i o o 
oio 

0 0 I 


-I 

1 

0 


Hence 


[#1, #2, # 3 , £4] p2* 0 0 

2 —5* 0 


= [5,-10, 0, 0], 


0' 

0 

5-10 0 0 

.3 - 5 0 0^ 

so that two unknowns are arbitrary (say x 3 = a and x 4 = /?), while 
x 2 = 2 — 2a — /?, and 2x t — 1 — a — ft . 

The solution is accordingly 

x = [h 2, 0, 0] + a[~£, -2, 1, 0]+ /?[-!> -1, 0, 1]. 

(iii) Equations Incompatible. Suppose 


fo, * 2 > * 3 ] 


Then 


4 

1 * 

1 

-2 


1 2 
1 3 

-2 -1 

1 3 

2 r 

3 4 

-1 7 


= [4,1,3]. 


-7 

1* 

3 


20 

”6*~ 


-2 -1 

I 0 

0 1 


-3 

1 
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The given equations are thus equivalent to 


[x v x 29 x 2 ] 


1 * 0 0 ' 
1 1 * 0 
-2 3 0* 


= [4,-7, 20], 


and they are therefore incompatible. 

(c) Solution by Direct Operations on Bows. For the application of 
this method, which is a variant of that described in § 4* 10, the equations 
are taken in the form ax = h. The initial right-hand array can be taken 
to be the column h , and only single elements are written below on the 
right. If during the process a null row can be formed on the left corre¬ 
sponding to a non-zero element on the right, the equations are incom¬ 
patible ; however, if this element also is zero, the equations are consistent 
but not linearly independent. If q null rows with corresponding zero 
elements on the right can be formed, the general solution will contain 
q arbitrary constants. 

Examples 

(iv) Equations Linearly Independent. The equations are assumed to 
be those given in example (i), and the computations follow the scheme 
of §4-10._ 


Operation 

Row No. 

Left Array 

Column 


'i 

1 

0 

3 

1 

2 


r 2 

4 

-1 

1 

-2 

1 


'a 

1 

3 

0 

5 

-1 


'4 

3 

-1 

2 

1 

3 

r 2 - 4 r x 

's 

0 

-1 

-11 

-6 

-7 

'«-'i 

'e 

0 

3 

-3 

4 

-3 

U - 3r, 

'7 

0 

-1 

-7 

-2 

-3 

3 U + r, 

H 

0 

0 

-36 

-14 

-24 

' 7-'5 

'9 

0 

0 

4 

4 

4 

9 r 9 +r 8 

r io 

0 

0 

0 

22 

12 

^ l 2 r 10 

'u 

0 

0 

0 

1 

<! 

TT 

i r » _r u 

'n 

0 

0 

1 

0 

A 

r,+2r„ 

'l3 

0 

-I 

-7 

0 

-IT 

'i-rii 

'l4 

1 

0 

3 

0 

15 

r u + 7r„ 

'lB 

0 

-I 

0 

0 

11 

r w ~ 3 r i» 

'ie 

1 

0 

0 

0 

* 

'if 

— 

1 

0 

0 

0 

I 1 ! 

“'is 

— 

0 

1 

0 

0 

-1! 

'l» 

— 

0 

0 

1 

0 

* 

'll 

— 

0 

0 

0 

1 

* 
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Hence # 2 » *^ 4 } “ {lV* if’ ii» yi}* 

(v) Equations not Linearly Independent. If the equations are those 
of example (ii), the scheme of computation is 


Operation 

Row No. 


Left An ay 


Column 



2 

2 

5 

3 

5 


r ‘i 

0 

l 

«> 

4 

5 


r. 

4 

-1 

0 

I 

0 


*4 

2 

0 

I 

1 

1 

r z - 3r, 

u 

0 

- 5 

- 10 

- .) 

- 10 

r z ~ - r i 

r t , 

0 

- 5 

- 10 

- 5 

- 10 

f 4 ~ r \ 

r 7 

0 

-2 

-4 

2 

-4 

“ r 5 

r H 

0 

0 

0 

0 

0 

5r 7 - 2r 5 

r 9 

0 

0 

0 

0 

0 


Since two null rows are derived, only two of the original equations are 
linearly independent, and these may be chosen to correspond to r x 
and r R . Thus 


2 2 

5 

3 % 2 ’ *^ 4 } — 

5 

0 1 

2 

lj 

2 


Two of the unknowns may be assigned arbitrarily, say x z — a and 
# 4 = /?. The equations can then be written 


21 [x x +ar5‘ 
1 .^ 2 - A 


+fi\ 


and the calculations are continued as previously. 


Operation 

Row No. 

Left Array 

Column 


r io 

2 

2 

5 - 5a - 3/3 


'n 

0 

1 

2-2a-0 

kw-ni 

r i2 

1 

0 


r n 

— 

0 

1 

2 - 2a — /J 


Hence {x v x 2 } = {£, 2}-a{J, 21}, 

or {x 1; x 2 , x 3> x t } = {§, 2, 0, 0} + a{--2 , 1, 0} + /?{-|, -1, 0, 1}. 

(vi) Equations Incompatible. Taking the same equations as those in 
example (iii), we obtain 


Operation 

Row No. 

Left Array 

Column 



1 1 -2 

4 



2 3 -1 

1 


*3 

1 4 7 

3 

r 8 - 2r i 

*4 

0 1 3 

-7 

r a r, 


0 3 9 

-1 

>•5 - 3ci 

*0 

0 0 0 

20 


Row shows that the equations are incompatible. 
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(d) Iterative Methods of Solution . In certain cases it may be possible 
to apply successfully an iterative method of solution. Denote the 
equations as ax = h , and let a = v 4- w, where v is chosen to be a non¬ 
singular matrix which readily admits inversion. The exact solution 

then 18 * = a-'h = (1 + u-%)- 1 v-'h. 

If— v~hv and g = v^h, the solution may be developed as 

x = (I+f+P+P + —)9> 

provided that this matrix series converges. The conditions for con¬ 
vergence are satisfied if the moduli of the latent roots of /—namely, 
the roots of the equation | AI — / | = 0—are all less than unity. This 
equation can be written alternatively as 

| v\ + w | = 0. 

When the foregoing conditions for convergence are satisfied, the suc¬ 
cessive approximations to x , say z(0), x(l), x(2), etc. may be taken as 

x(0) = g , 

x(l) = (I +f)g =g+fx(0), 
x(2) = (I+f+f 2 )g = g+fx( 1), 
and generally x(r+ 1) = g+ fx(r). 

If an approximate solution is already known, this will naturally be 
used as the first approximation in place of #(0) = g\ the computations 
are thereby shortened. 

Two methods of solution based on this principle have been developed. 
A tabular method due to Morris is effectively the same as that 
described, with the matrix v chosen to have zero elements to the right 
of its principal diagonal and the same elements as a elsewhere; thus w 
has zero elements in and to the left of its principal diagonal, and the 
same elements as a elsewhere. Since v is triangular, its reciprocal can 
readily be found as in § 4*5. Morris has applied his method successfully 
to the solution of sets of linear equations such as arise in certain struc¬ 
tural problems.* In these problems the matrix a is symmetrical, and 
has its origin in a quadratic form which has all its discriminants 
positive. It may be noted that it is possible to convert any system 
of linearly independent algebraic equations ax = h to a system of the 
foregoing type by premultiplication by the matrix a' (see §§ 1-13,1*16). 
* R«f. 9. For such problems the method is always convergent. 
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In the second method the matrix v is chosen to be diagonal or more 
simply the unit matrix. In certain types of statical problem the 
equations arise naturally in the form x = g +fx, and provided the 
latent roots of / are all less than unity, the process described, with 
v = I, provides the solution. An illustration is given in § 10-9. 

Example 

(vii) Morris' Type of Solution. Let the equations be 
'25 2 11 pEjl = r691, 

2 10 1 x 2 63 

.1 1 J UJ 1.43. 

for which the exact solution is x = {2,5,9}. Then 

v = T25 0 01; tr 1 = V 0 04, 0 1; 

2 10 0 -0-008, 0-1, 0 
.1 1 4j L-0-008,-0-025,0-25 

/ = - v~hv = [0, - 0-08, - 0-04 ] g = v~% = T2-7601. 

0, 0-016,-0-092 6-748 

.0, 0-016, 0-033J 18-623 


Hence the iterative process yields 


*(0 )=g 

M0) 

*(i) 

MU 

z(2) 

M2) 

*(3) 

M3) 

1(4) 

M*) 

m 

2-760 

6-748 

8-623 

-0-7013 

0-3765 

5-0467 

8-9995 

-0-7472 

0-3777 

5-0008 

9-0007 

-0-7481 

0-3770 

4-9999 

9-0000 

-0-7480 

0-3770 

5 0000 
9 0000 

-0-7600 

-0-7480 

0-3770 

20000 
5 0000 
9-0000 


It may be noted that until the final step is reached it is unnecessary to 
compute the leading element in x(r), since this is always multiplied 
by a cipher in the iteration. 


Part II. High Powers of a Matrix 
and the Latent Roots 

4-14. Preliminary Summary of Sylvester’s Theorem. A high 
power of a matrix in general approximates to a relatively simple form, 
and the dominant latent roots—namely, the roots of greatest modulus 
—can be found from this limiting form. The method of calculation* is 
based on simple applications of Sylvester’s theorem. For convenience 
of reference the simple and the confluent forms of this theorem will 
first be restated from §§ 3-9 and 3-10. 

* Refe. 10,11. 
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Simple Form of Sylvester's Theorem. If P(u) denotes any polynomial 
of a square matrix u of order n, and if all the latent roots A x , A 2 ,A n 
of u are distinct, then 


r-1 


where Z 0 (\) denotes the square matrix 
(i) (i) 

^(K)IMK)—K K r l&(K) = n (A jl-u)! n (A i ~A r ), 

i+r j*r 


( 1 ) 

and A(A r ) is an abbreviation for 



Confluent Form of Sylvester's Theorem. If the latent roots are not 
all distinct, the terms contributed to P(u) by a typical set of s equal 
roots A x , A 2 ,..., \ 8 are given by 


T(K) = 


i r d*- 1 P(A) F(\n 
la-lUA*- 1 A S (A) L a ; 


where A*(A) s (A — A a+1 ) (A — A a+2 )... (A — A n ). An alternative is 

(i) («-D 

n A.) = P(K) Zs-i(K )+-j^ ZUK) + • • •+£zy z o(K), 


•(i) 


in which 

for i = 0,1,— 1, and 


(i) 

pm 


= WJ\ 

“l d* La/ 


Any matrix coefficient of the type Z 0 (A S ) either has unit rank or is 
null. It is of unit rank, and therefore expressible as a matrix product 
Jc 8 k s , when the characteristic matrix /(A) = A/ — u is simply degenerate 
for A = A s . In other cases it is null. 


4*15. Evaluation of the Dominant Latent Roots from the 
Limiting Form of a High Power of a Matrix. The limiting 
form of a high power of a matrix depends upon the number and the 
nature of the dominant latent roots (see also examples (i) and (ii) of 
§ 3-9). Some of the possible cases are discussed below. 

(i) Single Dominant Real Root. If there is a single dominant root, 
say A x , the general form of Sylvester’s theorem shows that when 
P(u) = u m and m is large x , 


( 1 ) 
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It will be convenient to express this as 

^ = AfZ 0 (A 1 ), 

where == denotes approximate equality. If E m is an element occupying 
a particular position in u m } it follows that 

E m+ll E m-+K- .( 2 ) 


(ii) Repeated Dominant Real Root . Next, let the roots of greatest 
modulus consist of a set of s equal roots A x , A 2 ,A 8 . Choose 

P(u) = u m P 0 (u), 

where P 0 (u) is a polynomial of u independent of m. When m is large 


A - m u m P 0 (u)-> 


1 A- 

r d »-i A m P 0 (A) P(Af] 


L**- 1 ' A.(A) J 


(3) 


If P 0 (u) = (A S I — u) 8 , all the terms in the expanded form of the expres¬ 
sion on the right of (3) vanish, so that 


A7^ m (A s /-^->0. 

Accordingly, if again E m is an element having a selected position in 
u m , then for m large A s will be given approximately by the equation 


E m*l - sE m +i K- 1 + E m+2 AJ- 2 +... + (-1)« E m+g = 0. 

.W 

Only one of the roots of this equation is the true value of A 8 . The true 
root is readily determinable, since it will be common to (4) and the 
similar equations which can be obtained from elements having other 
selected positions in u m . 

When the characteristic matrix has multiple degeneracy for A = A 8 , 
equation (4) can be replaced by one of lower degree. Suppose the 
degeneracy to be q( > 1): then i^(A) and the derived adjoint matrices 

(Q- 2) 

up to and including F(\) are all null for A = A s (see Theorem (E) of 
§ 3-5). Accordingly, if in (3) we choose P Q {u) = (A a / - u) 8 ^ 1 , the terms 
in the expanded form of the expression on the right again vanish, and 
the final equation for A 8 is 

- (« - g +1) +... 

+ ( ~ l ) S _ a + 1 -® m +«-®+ 1 = 0 - 

In particular, when q = s, the root A a is given by 

E mK~ E m +1 = °* 
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(iii) Dominant Roots a Conjugate, Complex Pair. Suppose the roots 
of greatest modulus to be A \ — p+io) and A 2 = p—icj.. Then if as 
before P 0 (m) is a polynomial independent of m, 

n m P 0 (u) = A”P 0 (A 1 ) Z 0 (A 1 ) 4- Ag*P 0 (A 2 ) Z 0 { A 2 ). .(6) 

When P 0 (u) = (A x I — u) (A 2 I—u), 

(6) gives « m (A x I—u) (A 2 /— u) = 0, 

or, approximately, 

C« 2 + to 2 ) E m - 2pE m+1 + E m+2 = 0. .(6) 


The values of fi and co can be found from (6) and a companion equation 
which is either the next equation in the same sequence or the corre¬ 
sponding equation for some other element of the matrix. 

Other possible cases arise when the dominant roots embrace several 
distinct roots or several distinct sets of equal roots. When required, 
the formulae appropriate to such cases can be constructed by the 
methods already exemplified. The nature of the dominant root or 
roots is in general disclosed by the manner in which E m+1 IE m varies 
with m. If this ratio quickly converges to a constant, then the dominant 
root is real and equal to this constant. If the ratio does not change sign 
but is very slowly convergent, there are two or more equal or nearly 
equal dominant roots. On the other hand, if the ratio does show 
changes of sign when m is large, then the dominant roots are complex. 

Simplified methods for obtaining the latent roots are given in § 4*17. 

Examples 

(i) Single Dominant Real Root . Suppose 

u = p 4 61. 

3 9 16 

4 16 36 

Here, by direct multiplication, 

u* = [ 72220 264188 5578201, 

173289 6,33957 1338633 

.388448 1421248 3001248. 

tt 5 = T 3168284 11591692 244776601. 

7602981 27816897 58739877 

.17045632 62364992 131694336. 
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The ratios of the elements of u s to the corresponding elements of w 4 are 


43-8699, 

43-8746, 

43-8814 


43-8767, 

43-8782, 

43-8804. 


43-8809, 

43-8806, 

43-8799. 


By (2) these ratios, whose mean is 43-8781, should give an approximate 
value for the dominant root. The actual latent roots are 43-8800, 
2*71747, and 0-402541, so that although the powers of u employed 
are very low, the approximation to the dominant root is quite 
good. The separation of this root from the adjacent root is, however, 
unusually great in the present example. Generally, the approximation 
is not so rapid. 


(ii) Once Repeated Dominant Real Root. The typical equation giving 
the latent root in this case is (see (4)) 


Suppose 


E m A? — 2E, 


u = 


'm +1 

4 

2 

3 


^ + E m +2 

1 2 1 ’ 
4 -3 

1 3 


= 0 . 


of which the latent roots are 5, 6, 1. It is found that 


tt® = 


962149 625000 634761 , 

-463868 -234375 -161132 
952148 625000 63477. 


v? = 


’ 5249024 3515625 2197261, 

-2807618 -1562500 -708007 
5249023 3515625 219727. 


u 10 = 


’ 28686524 
-16479493 
28686523 


19531250 610351' 
-9765625 -3051757 
19531250 610352. 


The leading diagonal elements give 

952149A 2 -2(5249024) A+28686524 = 0, 
so that A = 5-000008 or 6-025628. 


Similarly the last diagonal elements yield 

A = 4-999966 or 1-923078. 

Hence A = 5. 



138 


CONJUGATE DOMINANT ROOTS 


4 - 15 - 4-16 


(iii) Unrepeated Conjugate Complex Dominant Roots. Let 


u = 


[26 -54 
13 -28 


L26 -56 


4], 

3 

5 


of which the latent roots can be shown to be 1 and 1 ± 5 i. Here 


u 8 


’-2280668 4933944 -372606], 

-1184118 2645932 -277695 

-2372214 5117340 -372911 


u 9 


’-4843852 5871576 3816130], 

-3610022 5407196 1812849 

.-4847830 5697052 3998609. 


u 10 = 


'49609716 -116539400 
23567050 -57979844 
.51981930 -121656740 


17319970] . 

10845745 

17692881. 


If E is the first element in the top row, equation (6) becomes 
- 2280668(/* 2 + oj 2 ) + 2(4843852) p + 49609716 = 0, 
while if E is the second element in the top row 

4933944(/* 2 + w 2 )- 2(5871576)/* -116539400 = 0. 

These equations yield p 2 + (i) 2 = 25*999977, and 2p = 1*999979. Hence 
[i = 1 and o) = 5 with an error of less than 1 in 10 4 . 


4-16. Evaluation of the Matrix Coefficients Z for the 
Dominant Roots. It is sometimes required to calculate not only the 
latent roots but also the corresponding matrix coefficients Z in Syl¬ 
vester’s expansion for P(u). In dynamical applications, for instance, 
these coefficients effectively determine the modes of the constituent 
motions. It will now be shown how the matrices Z appropriate to the 
dominant root or roots can be found, once these roots have been 
determined. 


(i) Single Dominant Real Root . If A x is the dominant root, then by 


(4*15*1) 


\j m u m -+Z 0 (\ 1 ). 


(ii) Repeated Dominant Real Root . Suppose firstly that the dominant 
roots consist of two equal roots A x , A 2 . In this case, when P(u) = u m 
and m is large, (4*14*1) gives 


A^ m+ 1 -u m -> A 2 ^i(A 2 ) + mZ 0 (\ 2 ). 


(1) 
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The next relation of the sequence is 

\-m u m+l ^ A 2 ) + (w + 1) Z 0 (A 2 ), 

so that — \ 2 m u m (X 2 I- u)-+Z 0 (\ 2 ), .(2) 

and A^ m - 1 « m [A 2 / + w(A 2 /-M)]->Z 1 (A 2 ). .(3) 

When A 2 / — u is doubly degenerate, Z 0 (A 2 ) is null, and then (1) gives 
aton “ 

The derivation of the appropriate formulae is rather more trouble¬ 
some if the roots of greatest modulus consist of s equal roots \ v A 2 , 

A s . In this case, taking P(u) = u m P 0 (u) and assuming m large, we obtain 
by (4*14*1) 

ZUK) +f* A '" p o(A)) Z a _ t ( A s ) +... 


U 


+ 




To isolate any particular matrix coefficient, say Z^AJ, it is necessary 

d p 

to choose P 0 (u) in such a way that (A m P 0 (A)) vanishes when A = A s 

dA p 

and p=t=6* — i— 1, but does not vanish when A = A s and p = s — i— 1. 
It can be verified that the appropriate polynomial P 0 (u, i) in this case 
is given by 

i- <,)■-<-■ [a; i+ mA;-v, i - u) +a;-*(a, /-«)*+... 

+ M m +\). (m t i -jj (A/ 

l! -1 

where i s — 1 . The matrix Z t ( A s ) is then given by the formula 
( _ iy-i-'A;rn-i u ,np o(Ui f) Z ;(A S ). 

As for the once repeated dominant root, the formulae can be 
simplified if A S I — u is multiply degenerate. 

(iii) Dominant Boots a Conjugate Complex Pair. In this case, when 
P Q (u) = A 2 7 — u in (4*15*5), 

^(A 2 I - *0/A?(A 2 - A x ) -> Z 0 ( A x ). .(4) 

In general Z 0 (A*) is complex, and Z 0 ( A 2 ) is its conjugate. 
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4*16-4-17 


Examples 


(i) Single Dominant Real Root. Assuming u to be as for example (i) 
of § 4*15, and applying (4-15-1) with m = 5 and A! = 43-8781 (the mean 
computed dominant root), we obtain the approximation 


UK) - AlW 


0-0194799 

0-0467463 

0-104804 


0-0712705 

0-171030 

0-383446 


0-150499' 

0-361157 

0-809711. 


The value of Z 0 (\ x ) computed directly from J'(A 1 )/A(A 1 ) for the true 


root is 


'0-0194753 0-0712545 0-1504661. 
0-0467356 0-170992 0-361078 

0-104781 0-383361 0-809532. 


(ii) Once Repeated Dominant Real Root. Suppose u to be the matrix 
in example (ii) of §4-15, and apply formulae (2) and (3), with m — 8 
and A 2 = 5. The approximate results work out as 


UK) = 


1-249994 1-000000 -0-2499941, 

-1-249992 -1-000000 0-249992 

1-250004 1-000000 -0-250004. 


and 


UK) = 


0-437511 

0-812485 

L0-437492 


0 0-5624891, 

1-0 -0-812485 
0 0-562508. 


while the accurate values are 


and 


UK) = 

“ 1-25 

1-0 -0-25' 

f 


—1*25 

-1-0 0-25 



1-25 

1-0 -0-25. 


^ 1 (^ 2 ) ~ 

[0-4375 

0 0-56251 


0-8125 

0-4375 


1-0 -0-8125 
0 0-5625J 


4*17. Simplified Iterative Methods. It has been shown in 
§§4-15 and 4-16 that the dominant latent roots of a matrix u and the 
corresponding matrix coefficients Z can be evaluated by raising u to 
a high power. The same result can be achieved, and usually with less 
labour, by repeated premultiplications of an arbitrary column by u, or 
alternatively by repeated postmultiplications of an arbitrary row by u. 
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To illustrate the method, consider firstly the simple case where 
there is a single real dominant root A r Let #(0) be an arbitrary column 
of n elements. Then by (4* 15*1) 

u m x(0) =? Af Z 0 (A 1 ) a;(0). 

This may be written as 

(i) 

u m x{ 0) = Affc 1 /c 1 a;(0)/A(A 1 ), 

(i) 

or, if O denotes the scalar factor /c 1 a:(0)/A(A 1 ), as 

u m x( 0) = AfO^. 

Accordingly, unless #(0) happens to be so chosen that O vanishes, A x 
is given by the ratio of corresponding elements in u m+1 x( 0) and u m x( 0). 
The formula (4-15-2) is therefore still applicable if E m represents one 
of the elements of u m x( 0). Moreover, the modal column k x is propor¬ 
tional to u m x( 0). 

Again, if y( 0) denotes an arbitrary row of n elements, then 

y(0)u m = A?YK v 
(i) 

where Y = f/(0) kj A(A X ). 

Hence continued postmultiplication of y( 0) by u yields a row, the 
elements of which are proportional to the corresponding elements of 
k v Also A 2 can be derived as before. 

A further case of importance is where the dominant roots are 
conjugate complex and unrepeated. Equation (4-15*6) is clearly still 
applicable if E m is interpreted to mean an element of the column 
C m -=zu m x(0), and in this case by (4-16-4) 

(i) 

Zo( K) m S k^xiomx,) = (A 2 C m - C m+1 )/AJ*(A 2 - A x ). 

The modal column k x is thus proportional to 

A 2 C' m ~C , m+ i. 

Similarly, if E m is an element of the row R m ^=y(Q)u m , the row k x is 
proportional to . D D 

Another application of the iterative method, which may be 
mentioned here, is to the construction of the characteristic equation 
of a matrix u. If this equation is written 

A n +p 1 A n “ 1 +p 2 A n “ 2 -f ...+p n = 0, 
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then by the Cayley-Hamilton theorem 

u n +p l u n ” 1 -bp 2 u n ^ 2 f... +p n I = 0. 

Postmultiplying this equation by an arbitrary column #(0) and writing 
ux( 0) = #(1), u 2 x( 0) = ux( 1) = z(2), etc., we obtain 

x(n)+p 1 x{n--\)+p 2 x(n--2.) +...+p n x(Q) = 0. 

This yields n simultaneous scalar equations for the n unknown 
coefficients p. 

Examples 

(i) Determination of the Dominant Latent Root, Consider the matrix u 
of example (i) of §4-15. If {0,0,1} is chosen as an arbitrary column, 
the first premultiplication gives 


’2 4 6" 


"0" 

= 

‘ 6" 

3 9 15 


0 


15 

4 16 36 


.1. 


.36 


It is convenient, as each new column is obtained, to extract a scalar 
factor so as to reduce a certain element (say the last) to unity. The 


iterative process can then be tabulated as follows: 


Initial 

Iteration number 

column 

1 

2 

3 

4 

5 

0 

6 

01667 

800 

01846 

8-15 

0-1859 

8-16 

0 1860 

8-16 

0-1860 

0 

15 

0-4167 

19-25 

0-4443 

19-55 

0-4458 

19-57 

0-4460 

19 57 

0-4460 

1 

30 

10 

43-33 

10 

43-85 

1-0 

43-88 

1-0 

43-88 



The fifth iteration repeats the fourth. Hence 


[2 4 6] 


ro-1860] 

= 43-88 

[0-1860] 

3 9 15 


0-4460 


0-4460 

.4 16 36 


.10 


.10 . 


and 43-88 is therefore the dominant latent root, and (0-1860,0-4460,1-0} 
is the associated modal column. 


(ii) Construction of the Characteristic Equation . The characteristic 
equation of the matrix used in example (i) can be found by the direct 

iteration w {o, o, 1} = {6,15,36}, 


u{ 6,15,36} = {288,693,1560}, 
«{288,693,1560} = {12708,30501,68400}. 
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Hence 

■12708" 

+Pi 

' 288" 

+Pt 

■ 6‘ 

+Ps 

'O' 


30601 


693 


15 


0 


68400. 


.1560. 


.36. 


.1. 


or 

■ 288, 6,0' 

Pi 

= 

--12708" 


693,15,0 

Pi 


- 30501 


.1560,36,1. 

J>3- 


.-68400. 


The solution, which is readily found by any of the methods of § 4-13, is 

{P V P 2 > Pz) = { - 47 > 138 > ~ 48 }* 

The characteristic equation of u is therefore 

A3_47A 2 + 138A-48 = 0. 


4*18. Computation of the Non-Dominant Latent Roots. 

When the dominant latent root or roots have been found, the remaining 
roots can be obtained successively in the order of their moduli by an 
extension of the methods already explained. The essence of the exten¬ 
sion consists in the construction of a modified matrix v which possesses 
all the latent roots of u except the dominant roots. 

For simplicity, suppose there to be a single dominant root \ v and 
assume that A x and the corresponding row k x have already been 
calculated. Then, if X 8 is any other root, we have by (3*8*11) 


*iK = 0 . 


(1) 


Let K rl be any non-zero element of k v and denote as w the square matrix 

which has for its rth row and its remaining n — 1 rows all null. 

Then in view of (1) we can write k 8 = (I — w)k 8 . But (\ 8 I — u)k a = 0: 

hence also , ^ T w A /ox 

(\ 8 I-v)k 8 = 0, .(2) 


where 


V = u{I-w)=U- — {u ir }K v 


.(3) 


From (2) it is clear that A 2 , A 3 ,..., A n are latent roots of v, and that 
A? 2 j k 3) k n are the corresponding modal columns* of v. The remaining 
latent root of v is A = 0: this follows from the fact that the rth column 
of v is null, as is obvious from (3). When v has been obtained, the 
dominant latent root of this matrix (which is necessarily the sub¬ 
dominant latent root of u) can be calculated by the methods of §§4*15 


* It is also easily shown that the sth row of the matrix k appropriate to t; is given by — - — . 

K ri K rt 
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4*18 


or 4* 17. Evidently all the latent roots of u can be found successively 
by a continuation of the process. 

An alternative procedure in the calculation of the subdominant root 
is to omit the rth column and the rth row of v, and to apply the iterative 
method to the matrix of reduced order so derived. When this treatment 
is adopted the typical modal column k 8 will be determined apart from 
the element k r8) which will have to be calculated independently from (1). 

If there are two distinct dominant roots A x and A a , and if K rV Kgt(r*q) 
are, respectively, non-zero elements of k x and x 2 , the matrix w must be 
constructed to have KjK rl and K 2 / K q 2 f° r it® rth and gth rows respectively, 
and to have its remaining n — 2 rows all null. The matrix v = u(I — w) 
in this case has its rth and gth columns null, and has therefore two zero 
latent roots. The remaining latent roots are A 3 , A 4 ,..., A,,, and the 
corresponding modal columns are k 3 , k it lc n . The method of extension 
to cases of three or more distinct dominant roots will be obvious. 

It may be noted that in the special case of a complex pair of dominant 
roots Aj, A 2 , since k 1 is proportional to the row A 2 ii m — R m+1 , and 
similarly k 2 is proportional to the row A x ^ m —f? m+1 (see §4-17), the 
two equations corresponding to (1) yield R m k a — 0 and R m+X k a = 0. 
Hence the elements E can be used directly in place of the elements k 
in the construction of the matrix v. 


Example 

Computation of the Subdominant Latent Root. The method will 
be used to determine the subdominant latent root of the matrix u of 
example (i) of §4-17. 

Firstly, repeated postmultiplication of an arbitrary row by u yields 

[0-1294,0-4736,1-0] [2 4 61 43-88 [0-1294,0-4736,1-0]. 

3 9 15 

.4 16 36. 

Hence k x = [0-1294,0-4736,1-0], and choosing r = 1 in (3), we obtain 

v = [2 4 61 - [21 [1,3-660,7-728] 

3 9 15 3 

.4 16 36J [,4. 

= TO, -3-320, -9-4561. 

0,-1-980, -8-184 
.0, 1-360, 5-088. 
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The results of the iterative process, with v as premultiplier and {0,0,1} 
as an arbitrary column, are given below. It should be noted par¬ 
ticularly that, except in the last step, the top elements of the columns 
need not be calculated. 


Ttiitaxl 

Iteration number 

oolumn 

1 

2 

3 

■9 

-8*184 

-1*808 

-5 000 

-1*724 

-4*770 

-1*739 

u 

5*088 

1*0 

2*901 

1*0 

2*743 

1*0 


Iteration number 

4 

5 

6 

-4*741 

2*723 

-1*741 

1*0 

-4*737 

2*720 

-1*742 

1*0 

-3*673 

-4*735 

2*719 

-1*361 

-1*742 

1*0 


Since the sixth iteration repeats the fifth, the top element is computed, 
and the value 2-719 is deduced for the latent root. The corresponding 
modal column is proportional to { —1-361, —1-742, 1-0}. 

4*19. Upper Bounds to the Powers of a Matrix. Formulae 
giving upper bounds to the powers of a matrix are sometimes required, 
and we shall now obtain some simple formulae of this type. For 
definiteness, suppose u to be a square matrix of order 3 with real or 
complex elements, and write the typical element of u* as 

where U if (8) is the essentially positive modulus, and a has unit modulus. 
The value of a in each case will not be required, so that the suffices 
which should strictly be associated with this symbol are omitted. 
Further, denote as r it c i the sum of the moduli of the elements in the 
ith row and the jth column, respectively, of u, and let R i9 Cj be, re¬ 
spectively, the greatest modulus in the ith row and jth column. 

Then 

<*u(2) - U 11 (l)xU n (l)*+ U n (l)*U n (l)*+ U^(l) *U 31 (1) a, 
so that U n ( 2)< E£(l)+ U n ( 1) tyiR U Z1 ( 1). 


Hence ^n(2) and also The remaining elements of u 2 

may be treated on a similar basis. It follows that the moduli of the 
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elements of u 2 are all less than the corresponding elements of both 
the matrices 



rxO* 

r iC a 

and B= 


R x c 2 

Ri c z 

r 2 Ci 

r 2 C 2 

r 2 C 8 



B 2 c 2 

R%c z 

- r 3^i 

r 3 C 2 

r*C a . 



B 3 c 2 

-®3 C 8- 


Proceeding next to u 3 , and using A , we have 


u z = 

r x C 2 0 r x CJ' 


'U n {l)a U lt (l)a V a (l)a 


r 2^lP r 2^2^ T 2^zP 


^4i(l) a G 2i (l)a C4s(l) a 


- r 3^lA T 2p2p r 2pzP- 


.U al (l)a f/ 32 (l)a 


where /? represents a quantity of modulus not exceeding unity. Hence, 
for instance, f/ n (3) < (C x + C 2 + C 3 ) r x C v and ?7 al (3) ^ (C x + C 2 + C 3 ) r 2 C v 
This shows that the moduli of the elements of v? are dominated by the 
corresponding elements of (C 1 + C 2 -{-C 3 ) A, or briefly u* is dominated 
by (G x + C 2 +C Z )A, 


Again, using B , we obtain 


V? = 

'f/ u (l)a U Vi [\)a U 13 ( 1 ) a 


R\C x fi R x c 2 f} R x c z fi 


U n (l)cc iy l)a C4s(l)a 


R 3 ^\P Ri^fi R 2 C 2 P 


-^3i(i) a G 32 ( i) a £4s(i) a J 


-R 3 C\P R^zP R^sP- 


Consequently Z7 n (3) ^ (R x +R 2 + R 3 ) R x c x etc., so that v? is also domi¬ 
nated by (R x +R 2 + R 3 ) B. 

The argument can be extended to show that u 8 is dominated by the 
two matrices 

A 4 = (C x + C 2 + C z ) 8 ~ 2 A, 

B 8 = (R X +R 2 +R 3 ) 8 - 2 B. 

The corresponding theorem for a square matrix u of general order n 
is as follows: Let r {f c i be the sum of the moduli in the ith row and the 
jth column of u, respectively, and R { , C i be the greatest modulus in 
the ith row, and in the jth column, respectively. Then the moduli of 
the elements of u 8 will not exceed the corresponding elements in the 
two matrices 


A t s(C 1 +C a +...+ C n Y~* {r v r 2 ,r n } [C lt C 2 ,..., C n ], 
(R x ++... + i? n )* -a {iJ|> R a >..., JS n } [<5j,c 2 , ...,c n ], 
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where 8^2. Clearly, for the upper bound to the modulus of any specific 
element of u 8 the smaller of the two corresponding values given by 
A s and B a can be selected. 

From these results another upper bound can be derived which, 
though considerably cruder, is often sufficient in the discussion of the 
convergence of a matrix series. Thus, if U is the greatest modulus in 
u, then r { < nU and < nU. Hence u 8 is a fortiori dominated by 

n*-*U-*{nU,nU,...,nU)[U 9 

An upper bound to the modulus of any element of u 8 is therefore 

n^U 8 . 


Examples 


Take 



Then 

r*i = 3, c 1 = 4, 

<N 

II 

fH 

05 

= 3. 


r 2 = c 2 = 5, 

*2 = 3 , 

C z = 3. 

Hence 

7*3 = 3, c 3 = 4, 

* 3 = 2 , 

II 

to 


A 6 = 8*{3,7,3}[3, 3,2] = 

"36864 

86016 

36864 

36864 24576“ 

86016 57344 

36864 24576 


~ 7 4 {2, 3, 2} [4, 5, 4] = 

"19208 

28812 

.19208 

24010 19208“ 

36015 28812 

24010 19208 


In this case B e offers the smaller bounds to the moduli of the elements 
in u*. In actual fact 


u* = 


553, - 948, 600' 

— 2322, 4123, —2670 

1422, — 2496, 1627 


It may be noted that the bounds given by B e are much lower than 
the value 177,147 (= 3 6 3 6 ) obtained from n^U 8 . 
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Part III. Algebraic Equations or General Degree 


4*20. Solution of Algebraic Equations and Adaptation of 
Aitken’s Formulae. It is easy to verify that the equation 


X n +jpjA n 1 + ... +JJ n _iA+2) n = 0 


is the characteristic equation of the matrix 



L Pn’ Pn- 1> Pn- S> 


~Pi, -PlJ 


( 1 ) 


The methods of Part II can be used to obtain the latent roots of u, 
and therefore the roots of (1). 

It has been shown in example (vii) of § 3-7 that, for any square matrix 
u whose characteristic equation is (1), the elements E having a specified 
position in the matrices Y(a) = u 8+n satisfy the difference equation 


E(8)+p 1 E(8-l) + ...+p n E(8-n) = 0.(2) 

Consider firstly the simple product ux( 0), where x(0) is a column of 
arbitrary elements x v x 2 ,..., x n . This yields 

UX{0) = •••> 

where - x n+1 = Pl x n +p 2 x n _ 1 +... + +P n *i . (3) 

The value of x n+1 given by (3) is precisely the value of E(a) given by the 
difference equation (2) if 

E(a-1) = x n , E(a—2) = x n _ v E(a-n) = x v 

A second premultiplication by u leads to {x 3 ,x t , ...,a: n+a }, where 
* n+I is obtained from x n+v x n , ...,x t in the same way that x n+1 is 

obtained from x n , x^ . x v Hence repeated premultiplication of 

*(0) by u generates a single sequence of elements x which are the 
successive terms in that solution of the difference equation which 
begins with the set of numbers x v x s ,x n . The theory given in Part II 
shows that the do min ant root or roots of (1) can be obtained from the 
members of this sequence. When the dominant root is real its value 
is the limit of x p+l jx p when p is large. This is the method of solution 
invented by Daniel Bernoulli in 1728. 
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Bernoulli’s method of solution of algebraic equations in its original 
form is only applicable when the dominant root is real, but an extension 
has been developed by Aitken in which this restriction is removed.* 
Aitken gives a number of useful formulae, and, as these are all con¬ 
sequences of the fact that x, is given by an expression of the form (3), 
they can be immediately adapted to the case where the sequence of 
elements results from repeated premultiplication of an arbitrary 
column by any square matrix u. The most useful of these formulae can 
be written r\ in /it 


where 


Aj A 2 • • 

• Ap = 


4 

4+i 

••• 4+p-i 

4+1 

4+2 

— 4+p 

4+j>-i 

4+j> 

••• -®«+2p-2 


and E r represents the value of an element in some arbitrary fixed 
position in u T , u T x, or yu r . It is assumed that the latent roots are 
arranged in the descending order of their moduli, and that a is so large 
that the contributions to the elements E due to the roots following Ap 
are negligible. Aitken gives a scheme based on this formula for succes¬ 
sive approximations to the values of A 1( A 1 A 2 , A 1 A a A a , etc. (see also 
§ 10 - 11 ). 

The formula (4) can be proved as follows. By equation (3-7-4) the 
expression for the general element is approximately 


Accordingly 


4=2 e fl Aj. 


4 = 

A* 

Af 

... AJ, 

X 

• 

.. Af _1 e 1 


AJ +1 

A| +1 

... Aj+i 


^2 ^-2 ^2 

.. Af-% 


Aj+k - 1 

AI+*- 1 

... AJ+*- 1 


Gp Ap Cp 

• • A?-^ep 



AJ + 1 

A | +1 

... A £ +1 

X 

c i 

Ai e i 

.. Af-ie, 


AJ+* 

A| +a 

... K +2 



A2 € 2 

•• A 


xi+p 

A| +J) 

... A* +J) 


«P 

ApCp . 

.. A | _1 Cp 


Hence 


4»+i/4» — A^ A a ... Ap. 


* Ref. 12. 
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When p = 2 and A x and A 2 are conjugate complexes // ± ico, the for¬ 
mula becomes 

A, A* = u 2 4- oj 2 = 

E a E S +1 
-®s I 1 E 8+2 

This can also be deduced from (4 15-6) and a consecutive equation 
obtained by substituting m 4 1 for m. Another useful formula, which 
is deducible by a similar method, is 

E s E S ¥2 
E s,l E s +3 

E s +1 E 8+2 

When A x and A 2 are conjugate complex quantities, this equation yields 
a value for 2p which agrees with that given by equation (4-15-6) and 
the consecutive equation. 

4*21. General Remarks on Iterative Methods. The latent 
roots of a matrix are normally calculated by expansion of the cha¬ 
racteristic function and solution of the resulting algebraic equation 
by the usual methods. However, when the order of the matrix is high, 
this process is excessively laborious, and the methods given in pre¬ 
ceding sections become advantageous, more especially when only a 
few of the roots with greatest moduli are required.* Repeated multi¬ 
plications by a matrix are readily performed with any ordinary 
calculating machine, and a machine could no doubt be devised to 
perform most of the necessary operations automatically. 

The rapidity of the convergence in the matrix method depends 
acutely on the separation of the moduli of the latent roots. This can 
obviously be increased by using some positive power of the matrix in 
place of the matrix itself, but this does not always result in a saving of 
labour. For example, when the original matrix is of a simple form, the 
simplicity will probably not be wholly preserved in its powers. The 
modal columns derived from a power of the matrix are the same as 
those appropriate to the matrix itself, and the latent roots are the 
corresponding powers of the original latent roots (see example (v), 

♦ The latent roots with least moduli can be obtained by use of tho reciprocal of the 
matrix in place of the matrix itself. 




K + i 

E 8+2 

E s +2 

E a +3 
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§ 3-6). The ambiguities which arise in extracting the true value A, of 
a latent root from a power may be resolved by substitution of the 
known elements of the corresponding modal column k a in one of the 
n scalar equations represented by (A a I—u)lc % = 0. 

When the chief object is to obtain the latent roots, and little interest 
attaches to the modal columns, it is preferable to use the process of 
repeated postmultiplication of a row matrix rather than repeated 
premultiplication of a column, for the first process leads to the evalua¬ 
tion of the matrix k required in the determination of the subdominant 
latent root (see §4*18). 

It may be well to point out that in general Aitken’s formula (4-20*4) 
will fail to give the values of more than a very few of the latent roots 
unless extremely high accuracy is preserved in the calculations. On 
the other hand, equation (4-20-4) is of considerable value when there 
is a set of latent roots with nearly equal moduli which are considerably 
larger than the moduli of the other roots. The values of a pair of such 
roots with nearly equal moduli can be obtained as the roots of a quad¬ 
ratic by means of equations (4-20-5) and (4-20-6). 


4*22. Situation of the Roots of an Algebraic Equation. For 

the sake of completeness a brief reference may be made to certain 
further methods of computation which, though not directly relevant 
to the subject of matrices, are of considerable value in investigations 
on stability. The first method is based on the process described by 
Routh* as cross-multiplication. This process is most simply defined as 
follows: Let a 0 ,a v a 2 , ...,a n and b Qi b v b 2 , ...,b n be any two sets of 
n-f 1 numbers, some of which may be zero. Then by cross-multiplica¬ 
tion we shall mean the derivation of a set of n numbers c 0 , c v c n _ 1 
such that 

[Co, C v ..., C n _J = & 2 » • • •» “ a o\bii b 2 ,..., b n ]. 

A convenient scheme of entry is represented by 
Bow No. 1 a 0 

2 b 0 

3 c 0 


&4 

b± ^2 ^3 ^4 

Cl C 2 Cj ... • • 


Any member of row 3 is here derived by cross-multiplication by 6 0 
and a 0 of the two numbers situated above and one step to the right. 


For instance, 


C 3 —- — ®o^4* 


* Seep. 226 of Ref. 13. 
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(a) Number and Situation of the Real Roots of an Equation (Sturm’s 
Functions). Storm’s theorem may be stated as follows :f Let f(x) = 0 
be the given algebraic equation, and denote by f x (x) the first derived 
function of /(*). Denote by f t (x),f 3 (x), ...,f n (x) the remainders, with 
their signs changed, whieh occur in the process of finding the highest 
oommon factor of /(*) and f x (x). Let N(x) be the number of changes 
of sign in the sequence f(x),f x (x), ...,f n (x): then the number of real 
roots of f(x) = 0 between x = a and x = b is N(a)—N(b). 

The presence of equal roots is indicated by the vanishing of one or 
more of the auxiliary functions. If/ r (z) is the last of these functions 
not to vanish identically, then the difference between the number of 
changes of sign when a and b are substituted in /,/ x , ...,/ r is equal to 
the number of real roots between a and b, each multiple root counting 
only once. 

A convenient scheme for the calculation of the coefficients of the 
auxiliary functions is as follows: Write 

f(x) = p 0 x n +p 1 x n ~ 1 + ... +p n ^x+p n , 
fi(x) = Po(l)* n_1 +l>i(l)*"~ 2 + — +2>„-i(l). 

/»(*) = Z>o( 2 ) *“-* +Pi( 2) x n ~ 3 +... + jp„- a (2), 
and so on. Enter the coefficients of f(x) and of f x (x) respectively as 
rows 1 and 2, and derive in succession by cross-multiplication row 3 
from rows 1 and 2, and row 4 from rows 2 and 3. Then the numbers in 
row 4 will differ by a constant positive factor from the coefficients of 
f t (x). The presence of this positive factor is immaterial, since in forming 
Sturm’s functions positive numerical multipliers common to any row 
can be introduced or suppressed at convenience. The process is then 
continued to give the coefficients of f z (x), by the adoption of the 
coefficients of f x (x) and f t (x) as new initial rows. A representative 
scheme of computation is set out below: 

Row No. 1 p 0 p x p t p 3 

2 *>o(l) 2>i(l) 2>s(l) 

*3 x x x x ... 

4 2>o(2) 2>i( 2) Pa(2) Ps(2) ... 

♦5 x x x x ... 

6 2>o(3) Pi(3) 3) p z ( 3) ... 

etc. etc. etc. 


t § M of Ref. 14. 
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STURM’S FUNCTIONS 

Each starred row is constructed by cross-multiplication of the two 
immediately preceding unstarred rows. Each unstarred row (sub¬ 
sequent to row 2) is formed by cross-multiplication of its two immediate 
predecessors. To reduce labour any convenient positive multiplier 
may be extracted from any row: the resulting row is then entered 
immediately below, and the original row is of course completely ignored. 
The process will be clear from the appended example. 

Example 

Assume the equation to bef 

f(x) = a; 4 -2a^ — 3a; 2 + 10a; —4 = 0, 
so that f x (x) = 4X 3 — 6a; 2 — 6a; -f 10, 

or, on suppression of the positive multiplier 2, 
f x (x) = 2a^ — 3a; 2 — 3a; + 5. 

The scheme is as follows: 


/ 

1 

-2 

-3 

>—• 
© 

1 

rf*- 

/l 

2 

-3 

-3 

6 

* 

-1 

-3 

16 

-8 

/. 

9 

-27 

11 


* 

27 

-49 

46 


/, 

-288 

-108 



/. 

-8 

-3 



* 

243 

-88 



/« 

-1433 





Hence Sturm’s functions may be taken as f(x) and f x (x) together with 
/ 2 — 9a; 2 —27a; + 11; / 3 = -8a;-3; / 4 = -1433. 

The signs are as tabulated below: 


X 

/(*) 

AC*) 

/.(*) 

/»(*) 

/.(*) 

N{x) 

oo 

+ 

+ 

+ 

- 

_ 

1 

1 

+ 

+ 

- 

~ 

- 

1 

0 

- 

+ 

+ 


- 

2 

-2 

- 

- 

+ 

+ 

- 

2 

-3 

+ 

- 

+ 

+ 

- 

3 

— oo 

+ 


+ 

+ 

1 ~ 

3 


There are thus two real roots, one situated between 0 and 1, and the 
other between — 2 and — 3. 


f Example selected from § 96 of Ref. 14. 
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TEST FUNCTIONS 


4*22 

(6) Number of Roots whose . Real Parts are Positive . the rules as 
given by Routh* in this connection may be stated as follows. Let 
the given equation be 

Po xn +Pl x1l ~ l + — +Pn-l x +Pn = 0 , 

and assume p 0 to be made positive. A sequence of test functions 
may now be derived as follows. Commence with R 0 = p Qi and derive 
from R 0 , J? 2 from R lf and so on, by writing the lower element for 
the upper element in the columns of the schedule 


Po 

Pi 

Pi 

Pi 

etc. 

Pi 


Pi 


etc. 


A cipher is to be written for any letter when the suffix exceeds the 
degree of the equation. For instance, in the case of a biquadratic, 
Vb> e tc. are zero, and the test functions as constructed on the 
foregoing basis are 

Rq = p Q \ R x = Pi, R 2 = P 2 


^3 = ^ 3 - 


PlP 4 



P1P2P3 P§p\ P\P\, 'O _ ATX 

-, II 4 — ^ 4 . 

PlP2-P0P3 


For an equation of general degree the number of roots with their 
real parts positive equals the number of changes of sign in the sequence 
of test functions R { , 

Routh’s test functions can be expressed conveniently in terms of 
the sequence of test determinants 


T 0 = Po\ T i = Pi> = 


Pi po ; 


Pa Pa 


Pi 

Po 

0 

; * 4 - 

Pi 

Po 

0 

0 

Pa 

Pa 

Pi 


Pa 

Pa 

Pi 

Po 

Pa 

Pa 

Pa 


Pa 

Pi 

Pa 

Pa 





Pi 

Po 

Pa 

Pi 


and so on. As before, a cipher is to be substituted for any coefficient 
with a suffix exceeding n. It can be shownf that Routh’s substitution, 


* §§ 297-301 of Ref. 13. 


t Ref. 15. 
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as given above, converts the determinant T r of the sequence into 
T r+1 /p v Hence 

^4 = T 0 ; iJj — T x \ -®j = 2$/^2> ©tc. 

The penultimate test function is 7 , n _ 1 /T n _ 2 , and the final one is TJT n _ x , 
which reduces to p n . An equivalent set, which offers the advantage 
that no member becomes infinite when its immediate predecessor 
vanishes, is 

p' _ m . p' _ m . p' _ m m . p' _ m m . 

■*•0 — -*0» -**1 — -*ii ^2 — i 2 i i> - 4*3 — ■* 3 2> •••> 

■®n-l — ^n-l^n-2* *®» = 

The necessary and sufficient condition for the real parts of all the 
roots to be negative is that all the test functions, or all the test deter¬ 
minants, shall be positive. This is the condition for stability of a linear 
dynamical system. 

It may be noted that the vanishing of T n _ x indicates either a pair of 
purely imaginary roots or else a pair of equal and opposite real roots. 

(c) Situation of the Latent Roots of a Matrix . The following theorems* 
relating to the situation of the latent roots of a matrix u with real or 
complex elements may be noted. The typical latent root is denoted as 
\==/i + io). 

(i) If u is Hermitian (see § 1*17), and in particular if u is real and 
symmetrical, the latent roots are all real. 

(ii) If u is general, fi lies between the greatest and least latent root 
of the Hermitian matrix \{u + u'), and co lies between the greatest and 
least latent root of the Hermitian matrix $i(u — u'). 

* For proofs and historical references, see Chap, vm of Ref. 2. 



CHAPTER V 


LINEAR ORDINARY DIFFERENTIAL EQUATIONS 
WITH CONSTANT COEFFICIENTS 

Part I. General Properties 

5*1. Systems of Simultaneous Differential Equations. An 

important field of application of the theory of matrices will now be 
considered, namely, the solution of systems of linear ordinary differen¬ 
tial equations. Such systems arise in many branches of mathematical 
physics, but the majority of the illustrative problems discussed in later 
Chapters will relate to dynamics. Hence it will be convenient to regard 
the independent variable as representing time, and to denote it by t. 

The most general system of linear ordinary differential equations 
with constant coefficients consists of a set of m equations connecting 
the m dependent variables x v x 2 ,...,x m with t. These equations may be 
expressed as 

Ui =/i i(D) x x +f 12(D) x 2 + ... +, fim(D) x m — Si (f) — 

—/ 21 (D) x x + / 22 (D) x 2 -h... +/ 2 m(D) x m — £ 2 (0 = 0, 


U m ^/ ml (D)x 1 +/ m2 (D)x 2 + ...+/ m JD)x m -^ = 0. 

Here D denotes the differential operator d/dt\ / {i (.D ) is a polynomial 
of D having constant coefficients; while £ 1 (^), etc. are given functions 
of t. The set of equations is expressible as the single matrix equation 


0, . ( 1 ) 

or, when no ambiguity can result, even more concisely as 

U = /*-£ = 0. .(2) 

It may be referred to simply as the system/. 


If the elements of the matrix /(D) are polynomials of degree N at 
most in Z>, the system is said to be of order N. It is sometimes con¬ 
venient to express equations (2) in the alternative form 

(AoDN+A^-i + .'.+A^D+A^x = £.(3) 

Each of the coefficients A 0 , A v etc. is a square matrix of order m 
having constant elements. In particular cases one or more of these 
elements may be zero, but one element at least of A 0 must not be zero. 
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SYSTEMS OF EQUATIONS 

The system /is said to be homogeneous when the functions £$) are 
all absent: if one or more of these functions is present, the system is 
non-homogeneaw. The general solution of a non-homogeneous system 
is the sum of the complementary function , namely, the general solution 
of the homogeneous system obtained by omission of all the functions 
£i{t)j and of any particular integral of the complete system. 

The theory is intimately bound up with the properties of the matrix 
/(D), which are obviously closely analogous to those of A-matrices (see 
Chapter m). The usual process of construction of the complementary 
function practically amounts to a substitution of A for the operator D 
in /(D). The method, which is well known, is to assume a constituent of 
the complementary function to be x = e A %, where e M is a scalar multi¬ 
plier, and k is a column of constants to be found. Then A and k must 
be such that /(D) e “k = e u f(. A) k = 0. 

The condition of consistency requires the index A to be a root of the 
algebraic equation A(A) _ | ^ (A) | = 0> . (4) 

while the column k r of constants corresponding to any given root Ay 
is determined to an arbitrary common multiplier by /(Ay) k r = 0. This 
process yields the complete complementary function in the standard 
simple case where all the roots of A(A) = 0 are distinct. 

Both the purely operational determinant | /(D) | and the corre¬ 
sponding algebraic determinant | /(A) | are sometimes referred to as 
the “characteristic determinant”. Although in most cases this can 
cause no confusion, it is preferable to adopt terms which distinguish 
between operators and algebraic quantities. It is also common practice 
to speak of equation (4)—sometimes also written in the quite meaning¬ 
less form | /(D) | = 0—as the “characteristic equation”. However, this 
term already has an established meaning in relation to A-matrices of the 
special simple type A/ — u. The following definitions will be adopted 


/(D) . the D-matrix. 

/(A) . the A-matrix. 

A(D) = | /(D) | ... the D-determinant. 

A(A) == | /(A) | ... the A-determinant. 

A(A) = 0 . the determinantal equation. 


The degree in A of A(A) is assumed to be n, and the roots of the deter¬ 
minantal equation are denoted by A x , A 2 ,..., A n . A distinct symbol is 
therefore used to represent each root, even when multiple roots occur. 
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5*2. Equivalent Systems. From a given system/ of linear differ¬ 
ential equations it is possible to form new systems by operations with 
polynomials of D on one or more of the equations U v U 2 , etc., and addi¬ 
tion. New complete systems g may thus be derived which will have 
the form 

gy-v = M D )(fy-£) = o. .(i) 


The square matrix /i(D) here applied as a premultiplier to the system 
/ is of order m, and its elements are given polynomials of D. The deter¬ 
minant | /i(D) | will be assumed not to vanish, but it may be constant 
in the sense that it is independent of D. 

In (1) the dependent variables are denoted by y whereas in (5-1-2) 
they are denoted by x. This distinction must usually be made since the 
derived system g normally contains greater generality than the parent 
system /. Clearly, every solution of / satisfies g, but every solution of g 
does not necessarily satisfy /. However, the important case is where 
both conditions are satisfied, so that every solution of/satisfies g , and 
conversely. The two systems are then equivalent , and the variables x 
and y can legitimately be identified. The condition for equivalence is 
that the determinant | /i(D) | shall be independent of D and not zero. 
In this particular case the elements of the reciprocal matrix /4 _1 (D) 
are clearly also all polynomials of D; and since /?/ — £ = /r^JD) (gy — rj), 
it follows that every solution of g then satisfies /, as required. The 
condition | fi(D) | = const, is necessary and sufficient for the equi¬ 
valence of the two systems. It is evident that the D-determinants of 
equivalent systems can only differ by constant multipliers. 


Example 

Suppose /to be the system 

(D*+l) Xl + 2D*x 2 = 0, | 

Dx x -f (2D 2 —l)x 2 = cos t j 

Then if /i(D) is chosen to be T 1, — D 1, the derived system g is 

[~A£ 2 +iJ 

r i, - d lp’+i, 2Z> 3 ir 2 / 1 ] = r i, -^lr o i, 
L-2>,2> 2 +iJL A 2Z> 2 — 1 J |y 2 J [-D, Z> a +lJ |_cos t\ 

or y 1 + Dy< t = aintA 

(Z>a-l)y, = 0. J 


(3) 






5*2-5*3 EQUIVALENT SYSTEMS 159 

Since | fi{D) | = 1 (anon-zero constant), the two systems are equivalent. 
The solution of (3) is here obvious by inspection. Evidently 

y 2 = + 

and hence y x = sin t — c^er 4 , 

where c v c 2 are arbitrary constants. Thus the solution of (2) is also 
x x = sinJ-c^ + Cge-*, 

X 2 = CiC^ + CgC"^ 

The important point to note is that equations (2) can be derived from 
(3) purely by operations with polynomials of D , just as (3) were derived 
from (2) by such operations. To obtain/ from g the premultiplier to be 
used would be 

rHD)m r i, -£]-* = p*+i, A]. 

L~Ai> 2 +iJ [ A lj 

When | /i(D) | is not a constant, then although g can be derived from/ 
by steps involving only differentiations, yet / cannot be constructed 
from g without operations of integration and the attendant introduc¬ 
tion of further arbitrary constants. 

5-3. Transformation of the Dependent Variables. In the 

preceding section attention was restricted to the use of matrices 
fi(D) as premultipliers to the equations. However, A-matrices are 
applied as postmultipliers when a direct transformation of the depen¬ 
dent variables a; to a new set, say X f is effected such that 


x = /i(D)X. (1) 

The following systems are then evidently the same: 

— 0 , .( 2 ) 

f(D)p(D)X-£ = 0. (3) 


If the general solution of (3) in the variables X can be found, then the 
corresponding values of x are given uniquely by (1), even when 
| n(D) | is not a constant. On the other hand, if the solution of (2) in 
the variables x is known, the most general values of X will not 
usually be deducible from (1) without operations of integration and 
the introduction of further arbitrary constants. The exceptional case, 
again, is when the determinant | fi(D) \ is a constant other than zero: 
the values of X are then determinable uniquely by means of the inverse 
relation X — n~ l {D)x. 
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TRIANGULAR SYSTEMS 


5 * 3 - 5*4 


Example 

If in (6*2*2) we substitute 

Gratia- 

the system is converted into 

P a +1, 2D* ] [ 2D, 2D 2 -1] TX{\ = [ 0 ], 
[ D, 2D 2 — 1J L —1» -D JUJ [cosIJ 

or 2 DX x + (D 2 -1) X a = 0, 

X x = cos t. 

Hence X a = —sin t+c 1 e l +c a e- t , so that 

fa] = I"2D, 2D 2 —11 T coat 1 

UJ [-!> -D J [-sin*+c 1 e‘+c,e- < J 

= [sinJ+c^+Cge-n. 

L -c^+^e-* J 


This solution is effectively the same as (6*2*4). Conversely, if the values 
of x just obtained are supposed known, then the corresponding values 
X would be given uniquely by 

pvi = [2D, 2D 2 -11-1 r x 1 = r D> 2 D 2 -1] fa . 

UJ L-i, -D J UJ L-i. - 2 D JUJ 


5*4. Triangular Systems and a Fundamental Theorem. 

A system, say hx —§ = 0, is commonly called a “diagonal” system 
when h(D) has the special triangular arrangement 




'Km \m ... 

0 Ajj(D) ... 


Km(D) • 

^2m(-^) 


L 0 ... 0 A mm (D)J 

It is preferable to describe such a system as triangvlor, since with 
matrices the term “ diagonal ” implies that only the principal diagonal 
elements are present. 

The first equation of the system here involves x x and (in general) all 
the remaining variables, the second equation involves all the variables 
with the exception of x lt and so on, until finally the last equation 
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involves x m only. In this case the variables are said to be in the 
diagonal order x v x 2 , ... ,x m . 

It can be shown* that any system of linear differential equations 
with constant coefficients is reducible to an equivalent triangular 
system in which the dependent variables have any assigned diagonal 
order (e.g. that adopted above). The number of arbitrary constants 
involved in the solution of any triangular system is clearly the sum of 
the degrees in D of the diagonal coefficients, and it therefore equals 
the degree in D of the product h n h 2 2 ... h mm . This product, however, 
is equal to the Z)-determinant of the triangular system, and is thus 
merely a constant multiple of the D-determinant of any equivalent 
system. The fundamental theorem follows, that the number of arbitrary 
constants entering into the solution of any linear system with constant 
coefficients equals the degree in D of the determinant A(Z>) = | f(D) |. 

Examples 

(i) A Given System Expressed as an Equivalent Triangular System. 
Suppose the system / to be 

(Z>+ l)x 1 + D 2 x 2 + (D+l)x z = 0/ 

(D-1)x 1 + Dx 2 + (D-1)x 3 = 0, • .(1) 

x 1 +x 2 +Dx 2 = 0.^ 

Use as premultiplier 

m ( d ) = r—1, -A & 

0, -1, (D-l) 

_ 1, ( —Z) 2 -f-D-f 1), (D Z — 2D 2 — D) 

the constant determinant of which is — 1. Then the result is 

-1,-Z) 2 , (Z)3-Z)2_1) 1^1=0, .(2) 

0, -1, (D 2 — 2D+1) x 2 

0, 0, (D 4 - 3D 3 + D 2 + D) J lx z _ 

or — x x — D 2 x 2 + (D* — D 2 —1)x 3 = 0, 

—x 2 + (D 2 — 2D+l)x 2 = 0, 

(D*-$B z + D 2 +D)x z = 0. 

* A formal proof is given in § 6*5 of Ref. 5. 
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CONVERSION TO FIRST ORDER 


54 - 5-5 


Triangular systems of the type (2), in which all the principal diagonal 
coefficients with the exception of the last are independent of D , are of 
special simplicity, and may be referred to as simple triangular systems. 

(ii) The Fundamental Theorem . The number of arbitrary constants 
entering into the general solution of (2) is evidently four (the degree of 
the last coefficient). Hence four arbitrary constants appear in the 
general solution of (1). This equals the degree in D of | f(D) |. 

5-5. Conversion of a System of General Order into a First- 
Order System. If the system fx - £ = 0 is not already of first order, 
suppose it to be expressed in the form (see (5*1*3)) 

(A 0 D N +A 1 D N ~ 1 + ... + A N _ l D + A N )x = £, ...,..(1) 

in which the coefficients A are square matrices of order m with constant 
elements, and N > 1. One element at least of A 0 is assumed not to 
be zero. 

Case I. Leading Coefficient Not Singular. Suppose firstly that 
| Aq | 4= 0, so that the reciprocal matrix Aq 1 exists. In this case, 
equation (1) can be premultiplied by A q 1 to give 

L^x = -’a N x — a N _ 1 Dx-...—a x D N - 1 x + Z„ .(2) 

where a { = A^A^ 

and £ = 

Now write X x = x, X 2 = Dx, ..., X N = D N ~ x x, 

so that DX x ~X 2 , DX 2 = X 3) DX N „ x = X N , .(3) 

while equation (2) becomes 

DX n = — a NX x — a N _ x X 2 —... — a x X N +£ .(4) 

If the N sets X v X 2 ,..., X m —each containing m variables—are 
regarded as a single set of mN new variables y defined by 

y = {XJ = {x xy ...,a m , Dx v ... y Dx m , 

the mN scalar equations satisfied by the variables y are represented 
by (3) and (4). This first-order system will be denoted by 

Dy = uy + rj , 

? = •••> 0 , £}, 


where 
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and where u is a square matrix of order mN which can be partitioned 
as below into square matrices of order m; 

u = r 0 , / m , 0 , 0 , 0 

0, 0, 4> 0, 0 

0, o, o, .... o, 4 

- ^jV> -~ a N- 1> “•••> ~ a i 

A system of differential equations of general order can usually be 
converted into a first-order system in an indefinitely large number of 
different ways. The special method given above has the advantage of 
simplicity. 

Case II. Leading Coefficient Singular. Next, let A 0 be of rank r. 
Then (see §3*13) non-singular square matrices P, Q , of order m, with 
constant elements, can be found such that 

oc 0 = P A 0 Q, 

where a 0 has zero elements with the exception of r units occupying the 
first r places in the principal diagonal. Now replace the m variables x 
in equation (1) by a set a? such that x = Qx , and premultiply the 
resulting equation by P. Then 

(a 0 D”+PA l QD"-' + ... + PA N Q)x = P£, 

or say oc 0 D N x = — a N x — a N _ 1 Dx — ...—a 1 D N - l x- .(5) 

The system of scalar equations represented by (5) resembles (2) except 
that the left-hand side of the ith equation is equal to D N X i when i < r, 
but is zero when i > r. Hence if a new set of dependent variables y is 
introduced such that 

{*/} = {*i-••••*«. D N ~ l x v ...,D N ~'x m }, 

the equations (5) w ill be reducible to a system of mN equations of first 
order similar to (3) and (4), except that the last m —r equations will 
consist purely of linear algebraic relations connecting the mN variables 
y. These m-r linear relations can be used to eliminate ra — r of the 
variables y from the remaining equations of the system. The final 
system will then consist of only mN — m -f-r independent equations. 

In practice, when A 0 is singular the conversion to a first-order 
system can usually be effected by a judicious manipulation of the 
equations, without actual determination of the matrices P and Q. 
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CONVERSION TO FIRST ORDER 


5-5 


Examples 

(i) A Single Equation of Order n. Let the given single equation be 
D n x+p 1 D n ~ 1 x +... +p n x — £. 

Then if {y lt y t ., ...,y n }={x,Dx, 

the equivalent system of first order is 



(ii) Linear Dynamical Equations of Lagrangian Type. The Lagrangian 
equations for the small free motions of a dynamical system in m 
generalised coordinates q v q t ,...,q m are of the type 

AQ + Bq + Cq — 0, 

where | A | + 0. They can be replaced by the system of 2m first-order 
equations represented by Dy — U y 

where y = {q, 

and u = r 0, -4, ] • 

l-A-'C, —A~ 1 B\ 

(iii) The Matrix A 0 Singular. Consider the system 

'3 2 11/)**+TO 0 11 Dx+ T1 2 l]a: = 0. 

21-1 010 0-12 
.1 1 2 J 0 Oj L 2 1 0. 

Here A 0 is of rank 2. Subtraction of the sum of the second and third 
equations from the first yields 

— 0 , ...... ( 0 ) 

which can be used to express D*x z in terms of D*x x and D*x z and lower 
derivatives. By this means, the first two equations may be replaced by 

- DXi + ZDx z - Dx t - x a + 2x t - 0,1 

aDVsj+SB*!—14D* g +6B*,+* 1 +6® 1 —7a^= OJ 
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Hence, if {y v y a , y a , y x , y a , y a } = {x v x a , x a , Dx v Dx a , Dx a } .(8) 

then Dy 1 = y x , 

= Vi> 

and from (7) and (8) 

By* = 2/4-%5 + 2/e+2/2- 2 ys. 

tys - - f vt+H-y* - 2 y« - hi - 2 v*+iv* 

Also (6) is equivalent to 

0 = 2/4+y 6 -2/6 + 2/i-“ 2 2/2 + 2/3* 

These equations can be used as they stand, or reduced to a system of 
five equations by elimination of one variable, say y 6 . The final system 
would then be 
D 


~y{ 

= 

" 0 

0 

0 

1 

0" 


>r 

V2 


0 

0 

0 

0 

1 



y 3 


1 

-2 

1 

1 

1 


y» 

Vi 


1 

-1 

-1 

2 

-2 



-Vs- 



2 

1 

J 


8 


-^ 5 - 


5-6. The Adjoint and Derived Adjoint Matrices. For con¬ 
venience of reference a number of important definitions and theorems 
relating to A-matrices will now be recalled. 


If 


/(A) = 


/uW> /12(A), 
AlW* f 22(A), 


flmW 9 

Ami A) 


the adjoint of/is defined to be 




^ii(A), F 21 (A),..., F n ,i(A) 
^12-^22•••> tfnzW 


where Fy is the cofactor of f if in the determinant A(A) = |/(A) |. Thus 
F is the transposed of the matrix of the cofactors of /. The two matrices 
/, F have the property 

/(A)F(A) — F(A)/(A) = A(A)/ m . 
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ADJOINT MATRICES 


5*6 


The pth derived adjoint matrix is denoted by F(A), and for 

, where A, is any root of A(A) = 0, is 

A-A r 

represented by F(A r ). 

Theorems (A) to (E) of § 3-6 may be summarised as follows: 

(а) Simple Roots. If A, is a simple root ofthedeterminantal equation, 
/(A,) is simply degenerate and F(A r ) is expressible as a matrix product 

^(A r ) = {fc< r }L* <r Jsfc r /c r , 

in which k ir , K ir are constants appropriate to the chosen root and at 
least one constant of each type is not zero. 

(б) Multiple Roots . If X 8 is a multiple root and if f(X 8 ) is simply 
degenerate, then the adjoint can be represented as for case (a) by 
F(X 8 )==Jc 8 k 8 . On the other hand, if f(X 8 ) has degeneracy g> 1, the 

{Q- 2 ) 

adjoint and the derived adjoints up to and including F(X 8 ) at least are 
all null. The root has multiplicity q at least when/(A s ) has degeneracy q. 
The columns k r and the rows K r have the properties (compare (3-6-5), 


(3 ' 6 ' 6)) /(A.) K = 0. .(1) 

*,M-) = 0. .(2) 


v . , . ld*F\ 

brevity the matrix 


Examples 

(i) Simple Roots . Assume 

/(A) = |"A 4* 2, A 8 + 2A 2 + 21. 

Then U A ‘ +2 J 

A(A) = -2(A-2)(A+l) and F(A) = rA 4 + 2, -A 3 -2A*-21. 

L — A 8 , A+2 J 

The roots, say A x = 2 and A a = — 1, are here both simple. They give 

18 -18J-J 9j [2, — 2] 

and ^(A,) - J 3 -3j = j 4 3J [1, -1]. 

(ii) Multiple Roots. Suppose 

/(A) = rilA 8 + 14A+15,8A 8 +14A+14,6A 8 +11A+10' . 
11A 8 +10A+ 11,8A 8 + 10A+ 10, 6A*+8A+7 
7A 8 +2A + 7, 6A* + 3A + 6, 3A 8 +3A+4 _ 
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In this case A(A) = A(A- 1) 2 (A +1) 2 , giving = 0, A 2 = A 3 = 1, and 
A 4 = A 5 = -1. Substitution of the roots in /(A) gives 


/(*») = 

15 14 10" 

11 10 7 

; /(A 3 ) = 

"40 36 26" 

32 28 20 

; Ms) = 

“12 8 4“ 

12 8 4 


. 7 6 4. 


.16 14 10. 


.12 8 4 


The first two of these are simply degenerate, while/(A 6 ) has degeneracy 
2. The corresponding adjoints are 




W = 


-2 4 

5-10 . 

-4 8 -- 

0 4 -8' 

0-16 32 
0 16-32 


-2 

5 

-4. 

4 

-16 

16 


[ 1 ,- 2 , 1 ], 


[ 0 , 1 , - 2 ], 


^(A 6 ) = 0. 


5-7. Construction of the Constituent Solutions. It will now 
be shown how the constituent solutions of a system of homogeneous 
differential equations f(D) x = 0 can be obtained. For clarity the con¬ 
stituents appropriate to simple roots and to multiple roots of the deter- 
minantal equation are considered under separate headings. 

(a) Simple Roots. Let A,. denote any simple root of A(A) = 0. Then 

8ince f(D) C V FiK) = e^f(K) F(K) = 0, 

it follows that every column of the matrix e x ** F^) satisfies the given 
differential equations. But by § 5-6 the columns of F(A r ) are all pro¬ 
portional to k r . Hence the constituent solution corresponding to the 
given simple root can be chosen to be an arbitrary multiple of e x ^k r . 

(b) Multiple Roots. Let A g be a multiple root, and for simplicity 
suppose, firstly, A a to be only once repeated. Consider the two matrices 

W Q (t,A 8 ) = e UF( A g ), 

Fi(<,A,) S [A e ^(A)]^ = e^[F(X,) + tF(X,)]. 

The first matrix W 0 (t, X,) is similar to that already considered and all 
its columns thus satisfy f(D) x = 0: this conclusion remains true, but 
is nugatory, when F(X a ) happens to be null. We shall now show that 
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all the columns of W x (t, A,) also satisfy the differential equations. By 
the usual properties of differential operators 

= eA, (4+«)A ( A ) /. .(1) 

Since X a is assumed to be a double root of A(A) = 0, both A(A) and 
vanish when A = A,. Hence 

f(D)W x {t,X a ) = 0. 

It can be proved by exactly the same method that, if A, is an a-fold 
root of A(A) = 0, then every column of every member of the family of 
matrices W^t, A,), W x (t, X s ),..., W a _ x (t, X a ) satisfies the differential equa¬ 
tions, where the typical matrix of the family is defined to be* 

It will be convenient to write W p (t, X a ) = e x J U p (t, X a ), so that 
(P) (P- 1) m(m —11 (J>~2) 

U p (t,X a ) = F(X a )+ptF(X a ) +^2 —*FW +...+( p F(X a ). 

Since there are a matrices U 0 , U x ,..., U a _ x corresponding to the s-fold 
root, and since each matrix has m columns, the total number of con¬ 
stituent solutions (including possibly some null solutions) obtained in 
this way would be me. However, corresponding to any s-fold root, only 
8 linearly independent solutions are to be expected. It follows that the 
me columns concerned are necessarily connected by «(m—1) linear 
relations. 

In view of the theorems summarised under heading (6) of § 5-6 it is 
seen that when f{X a ) has degeneracy q the matrices W p (t, A s ) and 
U p (t, X a ) are null for values of p up to and including q— 2 at least. 

* Mo™ generally, if W „( t , \) = e A <*" T > ^ , 

where r is an arbitrary constant, then every member of the family W 0 , W lt ...» W^i also 
satisfies f(D) z = 0. 
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Examples 

(i) Simple Case of a Diagonal System. Suppose /(D) *=0, where 
/(A) = RA-a), 0, 0 

0, (A—a) (A—/?), 0 

. 0, 0, (A—a) (A—/?) (A - y)_ 

= (A-a)n, 0, ° ], 

0,(A-/?), 0 

.0, 0, (A—/?) (A—y) 

and a+fi+y. Here A(A) = (A - a) 3 (A—/?) 2 (A — y), and 

F(A) = (A-a)MA-/?)RA-/?)(A-y), 0, 0‘. 

0, (A-y),0 

0, 0, 1. 

For the simple root A = y the matrix W 0 {t, y) — e 7t F(y) contains a 
non-zero column proportional to {0,0,1}. The one constituent corre¬ 
sponding to y may thus be taken as e^{0,0,1}. 

The degeneracy of/(A) is 2 for the root /? and 3 for the root a, and for 
each of these roots F( A) is null. Now 
(i) 

F(/3) = (/3-a)* R), 0, 01, 

o,(/?-y),o 
0, 0, 1. 

and this contains two non-zero columns, which lead to the two distinct 
constituents e^{0,1,0} and 0,1}. 

(i) 

Again, F(a) = 0, but 
( 8 ) 

F(a) = 2(oc—/?)Ra—/?)(«—y), 0, 01, 

0, (a-y),0 

0, 0, 1. 

which yields the three independent constituents e“*{l, 0,0}, e^O, 1,0} 
and e^O, 0,1}. 

Next consider the modifications when y = /?. In this case 
/(A) = (A-a)p, 0, Ol, 

0, (A-yff), 0 
0, 0, (A-/?)*. 



170 


CONSTITUENT SOLUTIONS 


5-7 


and A(A) = (A—a) 3 (A —ft) 3 , which yields two triple roots a and 0. 
As before, the matrix /(A) has degeneracy 2 for the root 0 and 3 for 
the root a, while F( A) is null for each root. The three constituents 
corresponding to A = a have the same forms as before. However, 


(i) 

F(0) = (0-a)* 


0 0 01 , 

0 0 0 

0 0 1 


which yields only one non-vanishing column. To obtain the full 
number of constituents corresponding to the triple root 0 it is necessary 
to proceed to (2 ) 


F(fi) = 2(0-a) 


0 0 

0 (0-a) 

0 0 


O' 

0 

2 


The three independent constituents required are now given by the 

<u 

third column of the matrix efi l F(0) and the third and second columns of 
(2) (i) 

e^[F(0) + 2tF(0)]. They may be taken as e^{0,0,1}, e^{0,0, <} and 
e*{0,l,0}. 

Lastly, suppose a = 0 — y, so that 


m - (A -«) 


1, 0, o • 

0, A — a, 0 
0, 0, (A-a) 3 . 


with A(A) = (A—a) 3 . Here a is a sextuple root, and/(a) has degeneracy 
3. Then 


i*(A) = (A-a ) 3 


(A-a)*, 0, 0 
0, A—a,0 

. 0, 0, 1J 


The derived adjoint matrices which are not null are 


(8) 

0 0 O' 

<4) 

5 F(a) - 

O 

o 

o 

(«) 

; F( a) = 

120 0 0" 


0 0 0 


0 24 0 


0 0 0 


0 0 6 


0 0 0. 


0 0 0_ 


while the matrices whose non-vanishing columns yield possible con* 
stituents are 

<s> , (4) (3) (6) (4) (») 

e a> F(cc), e' t [F(a) + 4tF(a)], and e*lF(a) + 5tF(a) + 10PF(ct)]. 
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The six independent constituents may clearly be chosen as the six 
columns of the matrix 


cat 


0 0 
1 0 
0 t 


1 

0 

0 


(ii) A more General System. Suppose 


m = 


A®—A® — 2A, A® — A 2 + A, 


A® — 3A, A®, 

A® + 2A® — A, -A®, 


-A ' 
-A 

A®+A®—A 


Here A(A) = — 2A®(A—1)®(A +1), giving the roots A = 0 (triple), 
A = 1 (double), A = — 1 (single). In this case 


F(\) = A*(A-1) 


A(A +1) 2 , -(A® + A®+1), 1'. 

-(A 2 — l)(A + 2), (A® + A 2 —2A— 1), — 1 
— A(A+1) (A +3), (A® + 3A 2 +1), —3_ 


This is null for A = 0 and A = 1, and for A = — 1 it reduces to 


JP( —1) = —2 


ro -l 


n. 


0 1 -1 
0 3-3 


The constituent corresponding to the single root A = — 1 may thus 
be taken as c^l, — 1, —3}. 

Since F( A) is null for the double root A = 1, the two constituents 
corresponding to this root will be given by two of the columns of 


e?F(l). By direct differentiation of the expression for F(\), it is seen 
that (i) 


*( 1 )- 


4 

0 

-8 


-3 n. 


-l 

5 


-1 

-3 


The three columns of this matrix are not linearly independent: the 
sum of the first two yields the third. 

(i) 

Lastly, it is evident that F( 0) = F( 0) = 0, and that 
( 2 ) 


F( 0) = -2 


o -i n. 
2 -1 -1 
0 1 -3 
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The three independent constituents corresponding to the triple 
root A = 0 may therefore be taken proportional to the three columns 
of this matrix. 

It should be noted that the arithmetical work involved in the 
evaluation of the derived adjoint matrices for the multiple roots is 
often greatly simplified by the. extraction, as a scalar multiplier, of any 
factors common to the elements of jF(A) (e.g. the extraction of A 2 (A — 1) 
as a common factor in this example). 


5*8. Numerical Evaluation of the Constituent Solutions. 

As explained in § 5*7 the constituent solutions appropriate to any given 
root A 8 of A(A) = 0 are found from the independent columns of the 
family of matrices W p (t, A 8 ), which are linear in F( A 8 ) and its derivatives. 
A possible method of computation of the constituents is to construct 
F(A) from the cofactors in /(A), and to obtain the necessary derived 
adjoint matrices by direct differentiation with A kept general. The 
value A = \ 8 is then inserted. However, in practice, the construction 
of F(A) for a general value of A is extremely laborious when the order 
of/(A) is even moderately large. A simpler treatment, which is actually 
an extension of the methods given in §4*12, will now be described. 

When A s is a simple root, the corresponding constituent can be 
taken proportional to any non-zero column of the matrix W 0 (t, A a ). 
A column proportional to the columns of F(A 8 ) can be calculated at 
once by the method of § 4-12. 

When A 8 is a double root, the constituent solutions are to be chosen 
proportional to any two linearly independent columns of W 0 and W v 
More generally, the required columns can be chosen from the matrices 
aW 0 a 

(i) 

and aW x +bW 0 = e A * , [oF’(A g ) + bF(\) + toF(A g )], 

in which a and b are arbitrary scalars. But since /(A a ) F( A g ) = 0 and 

0 

g^-/(A,) F{\g) = 0 when A, is a double root, we have also 

/(A,) [a-F(A g )] = 0, 


(X) (1) 

/(A.) [aF(A 8 ) + bF( A,)] +/(A.) [«F(A S )] = 0. 

Hence, if <j> denotes a column of aF(X e ), and <j> x the corresponding 
(i) 

column of aF( A.) + bF(. A,), y (A>) ^ = 0> . (1) 

/(A.)&+/?A,)0 = O. .(2) 
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Equation (1) when treated by the method of § 4-12 yields <f >, and if the 
column so determined is substituted in (2), <f> t can be found by 

(i) 

operation on the rows of /(A s ) and f(X 8 ) <f> in a way similar to that 
described in §4*12. If f(\ 8 ) is doubly degenerate F(A 8 ) is null, and the 
equation to be solved then is 

/(W 1 = 0. 

This yields two linearly independent solutions. 

When A a is an 5-fold root, an extension of the foregoing process 
yields successive columns which may be used directly to construct the 
constituent solutions. 


Examples 


(i) First-Order System . As a very simple first example, suppose 
f(D)x=[D+ 1 , —41 pTjl = 0. 

L 1, D- 3JU 2 J 

The determinantal equation A(A) = 0 has a double root A = 1, and 


/(l) — |2, — 4J, /(1) = 


The method of §4-12 shows at once that the columns of the adjoint 
F(l) are proportional to {2,1}, and we may therefore choose <f> = {2,1}. 

““ C:r + PI 


= 0 . 


Since/(l) is singular, the equations contained here are not linearly 
independent (see §4*13). Writing = (a, /?}, we obtain the single 
equation a — 2fi +1 = 0, so that 


^ x = {2/?-l,/?> = {2, !}/?—{!, 0}. 


The column multiplied by fi is an arbitrary multiple of (j> and can be 
disregarded. Thus <p x can be taken to be {1,0}, and the constituents 
can be chosen as e l {2, 1} and ef{l, 0} + te l { 2,1}. (1) 

The first columns of the actual adjoint F( 1) and of F( 1) are { — 2, — 1} 
and {1,0}. 


(ii) Second-Order System. Next consider the system for which 


/(A) = 


[2A 2 -4A + 1, A*-A+1, 2A-H. 


A 2 - 3A+1, A 2 —2A+1, 2A—1 
A 2 —6A+3, 2A 2 —6A+3,5A—3 
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Here A(A) = A 2 (A — l) 8 , and/(A) is doubly degenerate for the double 
root A = 0 and simply degenerate for the triple root A = 1. 

The method of §4*12 at once yields the two constituent solutions 
corresponding to A = 0. The rows of the transposed of/(0) are combined 
as follows: 

r i 

fj+r, 

r »+ r a 


1 1 3 

1 1 3 

-1 -1 -3 

1 0 0 

0 1 0 

0 0 1 

0 0 0 

0 0 0 

1 0 1 

0 1 1 


Hence the two constituents may be chosen as {1,0,1} and {0,1,1}. 
For the triple root A =* 1 we have 

( 1 ) ( 2 ) 

/(!) = - 


-i i r 

II 

i 

i - 

i-H 

O 

i_ 

, /(i) = 

‘4 2 0" 

-1 0 1 


-10 2 


2 2 0 

_2 -1 2. 


l 

lO 

1 

1 


2 4 0 


A column of F(l) is found by the previous method to be <f> = {1,0,1}. 
The equation determining (j> x is 
(i) 

/(!) <f>i +/(1) <t> =/(l)0i + {2,1,1} = 0; or, if <j> x ={a, fi, y}. 


‘-I 11' 


'a 

= 

‘-2‘ 

-1 0 1 




-1 

_2 -1 2 


.7. 


-1 


If we reject the last scalar equation represented here, and regard y as 
arbitrary, we can write 



the solution of which is readily found to be 



Hence fa s {a, yff, y} = y{l, 0, !}-+-{!, — 1, 0}. 


Here, again, an arbitrary multiple of <f> is present. 

To determine the next independent column 0 a ={£,?/,£} we can 
assign y at convenience; but for illustrative purposes the value will be 
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kept general. The equation for 0 2 is then 

( 1 ) ( 2 ) 

/(l)<4 2 + 2/(l)<4 1+ /(l)<S = 0, 


or 

-1 

1 

r 


T 

= 7 

-4" 

+ 

2" 

+ 

“-4 " 

= y 

-4" 

+ 

" — 2" 


-1 

0 

i 


V 


-2 


2 


-2 


— 2 


0 


— 2 

-1 

2_ 


X 


— 2 


4 


-2 


-2 


2_ 


Rejecting the last scalar equation, and keeping £ general, we write 



which gives J7-J ■un-ai-a- 

or Q = £{1,0, l} + 2y{l, -1, 0} + {0, -2, 0}. 

In this solution arbitrary multiples of <f) and <}) 1 are present, but it is 
legitimate to choose y = f = 0, and then 0 2 = {0, — 2, 0}. Hence the 
constituent solutions may be chosen to be 

*{1,0,1}, *{1, — 1, 0} + fe*{l,0,1}, 

and *{0, — 2, 0} + 2te*{l, -1, 0} + *V{l,0,1}. 

Actually the first column of JP(1) is {1,0,1}; while the first columns 
( 1 ) ( 2 ) 

of jP( 1) and F( 1) are, respectively, 

{ 4 ,- 1 , 3 } = 3 { 1 , 0 , 1 } + { 1 ,- 1 , 0 }, 
and {8, — 8, 2} = 2{1,0,1} + 6{1, -1, 0} + {0, -2, 0}. 


5*9. Expansions in Partial Fractions. We shall next consider 
certain identities which will be used extensively later. 

Suppose 0( A) to denote an arbitrary (q, m) A-matrix, and let A(A), 
jF(A) be, respectively, the A-determinant and the adjoint matrix of/(A). 
Then each element of the matrix product 6?(A)-F(A)/A(A) can be 
developed in partial fractions by the usual methods of algebra, and 
a similar type of expansion can therefore be assumed for the complete 
matrix product. The precise form of the expansion depends on the 
nature of the roots of A(A) = 0. 


Case I. Roots of A(A) = 0 all Distinct . In this case the correct form 
to assume is 

.(!) 


= Q(X)+ 2 At 


A(A) 


rtiA-V 
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Here 0(A), if present, is a A-matrix, while A v A a , etc. are matrices of 
constants which are determinable by the usual rules. Thus to find A r , 
multiply (1) throughout by A—A, and put A = A,. This gives 


Hence 


A r = ^(A,) J , (A r )/A(A r ). 

a< 2x) W - - 0 ( a ) + £ . 

4(A) "'klviX-K) 


If the product G( A) F(X) is of degree less than n (the degree of A(A)), 
then Q( A) = 0. 

As a particular case assume 0( A) = / m . Then (2) gives 

Sy =/~ 1 (A) = 0(A) + i Tl) J,(Ar) - -.(3) 

A(A) , ” 1 ^(A r )(A-A r ) 

When the degree of F(A) does not exceed that of A(A), the quotient in 
(3) will be a matrix of constants (or ciphers), say Q = G. To determine 
C put A = 0 in (3): then 

m_ ” n aj 

A < 0 >" 

and substitution for G in (3) yields 

m m, t wa ... 

m . ' 1 

This is the matrix equivalent of a well-knownidentity due to Heaviside.* 
Case II. Boots of A(A) = 0 Repeated. When repeated roots occur, a 
set of a roots equal to A, will give rise in (1) to a set of s terms of the form 
B, aB, B. 

A—A, + (A—A,)* + - + (A —W (5) 

Writing A(A) s (A — A,)® A 4 (A), we obtain in the usual way 

b i ra-* Q(A)jw i 

A S (A) J A _ A ; 

The preceding formulae are all identities in A, and following the usual 
operational methods for scalar differential equations, we may replace 

d 

A by the differential operator Z>=—, and so derive corresponding 

“operational” identities. Such operational identities are particularly 
convenient in the solution of systems of differential equations with 
given initial conditions (see, for instance, §6-9). 

* For remarks on Heaviside’s method of solution see { 6*9. 
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Examples 


(i) Operational Formulae for the Case of Distinct Roots . In (2) 
substitute A = 2), and postmultiply both sides by f(D) x(t) — £($), 
where x(t) and £(t) are columns of any sets of m quantities dependent 
on t. Then, using the property F(A r )f(A r ) = 0, we can write the resulting 
identity as 

G(D)x(t) - Q(D) {/(. D) x(t) - m} + £ A [ - ^~{^ )] x(t) 


+ 


0(D) F(D) 
A (D) 


n 


m- s 

r=»l 


D — \. 


m. 


( 6 ) 


Since f(D) —f(\.) is divisible by D —A r , the first summation in (6) is 
rational and integral in the operator D. 

If x{t) is now assumed to satisfy the differential equations 


then (6) reduces to 


j(D)x(t)-m = o. 


.(7) 




r-1 


.( 8 ). 


It should be noted that (8) holds good for arbitrary columns x(t\ and 
£(£) only when Q = 0 in (2). If Q does not vanish, then (8) is only true 
in conjunction with (7). The formula will be used in § 6-5 to obtain a 
special form of solution of a system of differential equations. 


(ii) Operational Formulae for Case of Repeated Roots. The modifica¬ 
tions to (6) and (8) when repeated roots occur will next be considered 
briefly. The terms contributed to the right-hand sides of these equa¬ 
tions due to a set of s roots equal to X 8 are clearly 




■(9) 


It is possible to substitute for the first summation in (9) another 
summation, each term of which is rational and integral in D. Thus it 
can be shown that 

£ TJTTvW .( 10 ) 

i-l\D — A,)’ i-i 

in which 

(D-WW, A.) = f(D) -/(A.) - (D- A,)/(A.) -... ~~j 

.( 11 ) 
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By expansion of f(D) =/(A, +Z)- A g ) in a Taylor’s series it is at once 
seen that #<(Z), A,) is rational and integral in D. To establish (10) it is 
only necessary to collect together the separate powers of 
and to show that the total coefficients of these terms vanish: use 
should be made of the relation l’(A)/(A)/A J (A) = (A—A,)*/. The actual 
proof can be left to the reader. 

The terms contributed to the right-hand sides of (0) and (8) may 
thus be taken as 

sw wo- i TirWw.( 12 > 

<-l <- i(V-A s ) x 


Part II. Construction of the Complementary 
Function and of a Particular Integral 

5-10. The Complementary Function. The notation and ter¬ 
minology summarised in the present section should be carefully noted 
as it will hereafter be adopted as the standard. 

The number of the dependent variables is m and the n roots of 
A(A) = 0 are A lf A 2 ,..., A n . These roots are not necessarily all distinct, 
but different suffixes are used to specify the full set of roots. Generally 
Ay denotes a typical simple root, and A g represents a typical member of 
a set of 8 equal roots. 

The complete complementary function of the differential equations 
f(D) x — £ * 0 is constructed as the sum of arbitrary multiples of the n 
independent constituents corresponding to the n roots A x , A a ,A n . 
Subject to the conventions just explained regarding the notation for 
roots, each root will contribute one constituent. 

(a) Simple Roots. The constituent appropriate to the typical simple 
root A,, is denoted by 

* = {fc lr , & 2 r> •■.,k mr }e x ' i = k r e x ' i , 

and the column of constants k r can be chosen proportional to any non¬ 
vanishing column of the adjoint F( Ay) of/(Ay). This matrix will be a 
product of the type k r K r 

(b) Sets of Equal Roots . If A g is one of a set of s equal roots, the con¬ 
stituent appropriate to A g is written 

x = {k u (t), k u {t) . krjt)}^ = k,(t)eW, 
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and the a columns of the type k 8 (t) appropriate to the whole set of 
equal roots may be chosen proportional to any a linearly independent 
columns of the set of matrices (see § 5-7) 

U 0 (t,A a ) - JW, 

( 1 ) 

ffifr A.) = F(A a ) + tF(A s ), 

( 2 ) ( 1 ) 

U 2 (t,A 8 ) = F(\ 8 ) + 2tF(\ 8 ) + m\ a ), 


( 8 - 1 ) ( 8 - 2 ) <*“ 3 ) 

CUM,) = F(A 8 ) + (a-l)tF(A 8 ) + { —^-flfiF(A 9 ) + ...+^F(A a ). 

The elements k is (t) are thus polynomials in t of degree a — 1 at most. In 
the particular case where 8 = 1 (and the root concerned is thus simple) 
they are constants, and the brackets indicating their dependence on t 
may be omitted. 

The columns k 8 (t) define, as it were, the modes of the constituents in 
their relation to purely exponential laws. The column appropriate to 
the typical root A 8 (whether simple or multiple) will be spoken of as the 
modal column corresponding to the root A 8 , and any element of a modal 
column will be referred to as a modal coefficient. 

Since k 8 (t)e x * by definition satisfies the differential equations 
f(D) x = 0, it follows that the modal columns have the property* 


f(A 8 +D)k 8 (t) - 0. 


The (m, n) matrix formed from the modal columns is of great importance 
in the further theory. It will be spoken of as the modal matrix and it 
will be denoted by 




MO 

^21 (0 


k 12 (t) 

^22(0 


*m(0 

^2 nW 


.(1) 


L^ml(0 ^m2(0 ••• k mn (t) J 

In view of the remarks in the footnote to § 5*7 it is seen that in the set 
of a modal columns appropriate to any set of a roots equal to A 8 it is 
always legitimate 10 replace t by (t — r ), where r is arbitrary. Hence 
in particular a modal matrix k(t) may always be replaced by k(t—r). 


* Equation (5-6*1) is thus the particular case of this equation for « = 1. 
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If M(t) represents the (n, n) diagonal matrix 

0 ... 0"|, .(2) 

0 e A ** ... 0 


L 0 0 ... e A ^J 

and c is a column of n arbitrary constants c 1? c 2 > the complete 
complementary function is expressible as 

x — k{t)M(t)c. 

With regard to the matrices U^t, A g ), it will be noted that F(X 8 ) 
appears as the coefficient of the highest power of t in every matrix of 
the set. Hence if U 0 (t, A g ) contains any one column—say the first— 
which is not null, then the corresponding (first) columns in each of the 
succeeding matrices U v TJ 2 , ..., t/ g -i must contain t, t 2 , ..., J 8-1 for highest 
powers. It is evident therefore that these 8 columns must be linearly 
distinct, and they will accordingly yield the required 8 modal columns 
appropriate to the whole set of 8 equal roots. 

If, on the other hand, F(X a ) is null (i.e. if/(A g ) is multiply degenerate), 
then the last term will be absent from U^t, A g ), and the coefficient of 

(i) 

t*- 1 (now the highest power of t) will be F( AJ. By the same argument 

(i) 

as before, any one non-vanishing column of F( A g ) will then give rise 
to s—1 modal columns; and the additional modal column required 
must then be sought from the first other non-zero column of the family 
U 2 , U Zi etc. which is linearly distinct from the s—l columns already used. 
The extension to more complicated cases will be obvious. 

Examples 

(i) Second-Order System. Suppose 

f(D) x s TD 2 — 5D + 3, 4Z) 2 -5 D 1 Y Xl ~\ = 0. 

[ 2D 2 -2), -2) 2 -Z) +3jUJ 

Here A(A) = — 9(A+1) (A — l) 3 , and 

F(\) = r-A 2 ~A+3, — 4A a +5A"|. 

L — 2A 2 + A, A a — 5A + 3J 

The roots of A(A) = 0 are denoted by A x = — 1 and A 2 = A 3 = A 4 = 1. 
The first column of jP(A x ) is {3, — 3}, so that we may choose k t *= {1, — 1}. 
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Again, jF(A 2 ) contains the non-zero (first) column {1, — 1}, and the 

( 1 ) ( 2 ) 

corresponding (first) columns of F(\ 2 ) and F( A 2 ) are {-3,-3} and 
{ — 2, — 4}, respectively. Hence we may choose • 

^(0 = ““ 1 }> 

^ 3 (0 — {— 3, -3}-M{l, — 1}, 

ut) = { — 2, — 4} 4- 22{ — 3, — 3} 4- 1*{ 1, -1}. 

The complementary function is accordingly 

uJ = 1, 1,-3 + *,-2-6* + * 2 ] re-* 0 0 onrcii. 

[x 2 \ L— 1, -1, -3 —Z, -4-6^-^J 0 e* 0 0 c 2 

0 0 e* 0 c 3 

0 0 0 e l _ c 4 _ 

If this result be denoted by x(t) = k(t)M(t)c, the reader can readily 
verify that other possible forms of the complementary function are 
x(t) = Jc(t — T)M(t)c' and x(t) = k{t-T)M{t — T)c", where c' and c" are 
new sets of constants, and r is arbitrary. 

(ii) Form of Solution when A(A) Vanishes Identically . If A(A) reduces 
to a constant, other than zero, so that the degree in A of A(A) is zero, 
no arbitrary constants can enter into the solution of f(D)x = 0. The 
general solution then is x = 0. On the other hand, if A(A) vanishes 
identically, the degree in A of A(A) is indeterminate, so that the presence 
of arbitrary constants in the solution is not precluded. This case, which 
arises when the m equations of the system f(D) x = 0 are not distinct, 
is abnormal. It corresponds to the case in which one or more ciphers 
are present in the principal diagonal in Smith’s canonical form for 
A-matrices (see §3*15). 

A simple example is the system 

h(D) x = Z), 0, O' — 0, 

0, D(Z>— 1), 0 x a 

_ 0, 0, 0J [z 3 - 

for which A(A) s 0. The general solution is clearly 
x x = c 19 
x 2 = c 2 + c 8 e*, 
x 3 == arbitrary, say \jr(t). 
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Consider now an equivalent system f(D) y = 0, say 
f(D) y s Ah(D) By « 0, 

in which | A | and | B | are assumed to be non-zero constants. For this 
new system A(A) s 0 as before, and (as can be directly verified) all the 
first minors of | /(A) | contain A 8 (A -1) as a common factor. The general 
solution of/(D) y = 0 will accordingly contain one arbitrary function, 
arising from the fact that | /(A) | =0. In addition there will be three 
constituents, not included under the arbitrary function, and involving 
three arbitrary constants in all, originating from the common factor 
of the first minors of | /(A) |. 

More generally, if the A-matrix /(A) is of rank r, the general solution 
contains n—r arbitrary functions. Further, if the common factor of 
the minors of order r is of degree p in A, then there are p additional 
constituents of normal type corresponding to this common factor. As 
an illustration, consider the system for which 

/(A)ssr A, A, A 2 +l'. 

A 2 -1, A 2 +1, A 

A 3 . A 3 + 2A,2A 2 +1_ 

Here/(A) is of rank 2, and the first minors in | /(A) | have no common 
factor. The general solution contains one arbitrary function and no 
constituents of normal type, and may in fact be written 

y 1= =(Z)4 + Z) 3 + 

y t = -(D*-D*-im)> 

Vs = — 2lhjr(t). 

The differential operators are here respectively the cofactors of the 
elements in the first row of | f(D) |. 

As a second illustration, take 

/(A) = f-1, A 2 (A 2 —1), A 3 (A 2 -1) 1. 

-1, A(A 2 -1) (A+1), A 2 (A 2 -1) (A+1) 

. A, A 8 (A 2 -1), A 4 (A 2 —1) 

This matrix is also of rank 2, but the first minors contain A(A 2 — 1) as 
a common factor. The general solution is 

Vi = o, 

y t = c 1 + 2c i e-*-Df(t), 

y, = c 1 +c a e‘+c 8 e- 4 +^(0. 
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5*11. Construction of a Particular Integral. Suppose the 
differential equations to be /(D) x —£(<) = 0, and denote a particular 
integral as P(t). Then evidently we can choose 

= (i) 

The interpretation of this symbolic solution will now be considered. 

(a) Functions £,(t) Exponential . A simple case frequently arising is 
where £(£) can be represented over the range of t under consideration 
by a sum of the type ^ = 

The indices 6 , and the columns of m constants p , may be real or complex. 
The part of the particular integral arising from the typical term of 
the series then is 

Pit) - -iP) e ei 0 _ petr o (2) 

m ~A(D) ep ~ e m p - (2) 


This solution fails in the exceptional case where 6 is a root of A(A) = 0 
(e.g. resonant forced oscillations of an undamped dynamical system). 
Suppose 6 = A,., where for simplicity A r is assumed to be an unrepeated 
root of A(A) = 0. The equations of which a particular integral is 
required are now /(£>) x = t»p. .(3) 

Assume P(t) = W^t, X r )b, where W x is the matrix defined in §5-7, and 
b is a column of constants left free for choice. On substitution of this 
value for P(t) in (3) and application of (5-7-1), we obtain the condition 


” ~ (i) 

A(A r ) -f ^A(A r ) 

( D 

Now A(Ay) = 0, but since A,, is here assumed not repeated A(A r ) + 0. 
(i) 

Hence 6 = p\ A(A r ). The particular integral sought is accordingly 

pa) =^[ 4 +#,)]/). 

MK) 

e m F(6) 

Since by (2), satisfies the differential equations 

f(D)x-e et p = 0, 

it follows by differentiation with respect to 6 that the particular integral 
of the equations J(D)x- t?e e, p = 0 

u m _ \" m 

' ~ 89»L 4(0) J P _ VM J 4(0 ) P ' 
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(6) Functions £(i) General. When the functions £(<) are general, the 
usual procedure is to make use of certain expansions in partial fractions. 

For example, if the roots A, are all distinct, we can use the simple 
identity n , 


= 2 


and express (1) as 




P(t) 


nm 2 nr 

r-i - - 


m- 


A (K)(D-K) 


The particular integral given by this method is thus 

n pXfi ri 

P(t) - F(D) S -J- e~V£(t)dt .(4) 

The lower limit of integration gives rise to terms which can be included 
in the complementary function, and it is often omitted for convenience. 
When repeated roots of A(A) = 0 occur, the part of the expression 

for 1/A (D) in partial fractions arising from a root A, of multiplicity 8 
8 

is of the form £ /r . 8 , where the quantities a are constants which 

a-u-^-A,) 8 

are readily found by the usual methods. The corresponding terms of 
the particular integral are then 


£ a a F(D)eVQ*e-V£(t), 

8-1 

where Q* denotes q repeated integrations with respect to t. 

Another possible treatment is to express the complete operator 
F(Z>)/A(Z>) in partial fractions. In the special case where the degree 
in D of F(D) does not exceed that of A(Z>), the operational form of the 
expansion corresponding to (5-9-4) can conveniently be used for this 
purpose. This yields 




D 


A(0) 


But 


so that 


D-\ 


~K%K) D - K 

|(<) = eV(i + A,<2)e-V£(<) 


m- 


= !(i)+VvjVvg(i)d*.(5) 


m 


A(0) 6W + . , <« 

1 A(A,) 


£(<) + 2 -^-F(X r )\ t er^mdt. 

f"l A/1 \ Jt t 


l A(A,) 


.( 6 ) 
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If (5*9*4) is applied to evaluate F(oo)/A(co), equation (6) can be written 

P (t) = ^M&)+ S.(7) 

i J u 


A(oo) 


r-1 


MK) 


As a particular application of (7) suppose £(l) = eP‘p. Then 


P(t) = e« 


F( oo) ” —e^-V) 


A(oo) /, + r ?i 


U) 

(tf-AjACA,) 


.( 8 ) 


The terms involving eM may be included under the complementary 
function and therefore omitted from (8). The particular integral can 
then, with a little reduction, be identified with (2). However, with 
regard to (8) and the more general formula (7), it may be noted that if 
(as normally) the degree in D of F(D) is less than that of A(D), then 
jF(oo)/A(oo) = 0 . In this case the particular integral as given has the 
convenient property that P(t 0 ) = 0. 
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LINEAR ORDINARY DIFFERENTIAL EQUATIONS 
WITH CONSTANT COEFFICIENTS (cmtmued) 

Part I. Boundary Problems 

6*1. Preliminary Remarks. In Chapter v it has been shown 
that the general solution of a system of linear differential equations 
f(D)z — £ = Oinm dependent variables contains n arbitrary constants, 
where n is the degree in A of the determinant A(A) = | /(A) |. We shall 
now consider the question as to how the values of these constants are 
to be determined in order that the solution may satisfy any assigned 
supplementary conditions.* 

A “supplementary” or “boundary” condition usually consists of a 
linear relation connecting the values of the dependent variables x , and 
possibly their derivatives up to a certain order, at one or more given 
points < 0 , t v etc. of the range of variation of t. The problem is said to 
be a one-point boundary problem , a two-point boundary problem , and 
so on, according to the number of different points t 0 , t v etc. concerned. 
Attention will here be restricted to one-point and two-point boundary 
problems. 

Examples 

(i) Simple One-Point and Two-Point Boundary Problems. If the 
differential equations are 

ri7Z> 2 - 17D + 6, 7Z) 2 -9D + 4l Tx{\ = 0, .(1) 

L 7D 2 + Z)+10, 3Z> 2 —D + 4 J U* 

the general solution, which involves four arbitrary constants, is 

-3} + c 2 e-*{l, — 2} + c 3 e 2 *{7, -20} + c 4 e*{8, -21}. 

If the values of x v x 2 and of their first derivatives x v x 2 at say t = 0 are 
assigned, we have a simple one-point boundary problem; whereas if 
the values of x v x 2 at t = 0 and t = 1 are given, we have a two-point 
boundary problem. In either case the four constants of integration 
Cj, c 2 , c 3 , c 4 are uniquely determinable. 

* The differential equations, considered in conjunction with the supplementary con¬ 
ditions, are sometimes referred to as a 44 linear differential system 44 . The differential equa¬ 
tions themselves are then spoken of as the 44 system of linear differential equations 44 . 
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(ii) General One-Point Boundary Problems. The supplementary 
conditions are not always of the simple form given in example (i). As 
an illustration of a rather more general class of one-point boundary 
problem we may suppose the differential equations to be (1) and the 
supplementary conditions to be 

3 *i(*o)- 2 i M*o) *^2(^0) = 

2a?i(^ 0 ) + 2^ 2 (^o) + ^ 2 (^ 0 ) == 

•^1(^0) “ 2 a: i(^o) ” ^^2(^0) “* 2 a ' 2 (^o) = ~ 1 > 

#1(^0) + ^2(^0) ^2(^0) = w 

It is of course possible here to solve these algebraic equations and 
to derive explicitly the initial values of x v x 2 , x v x 2 . The problem is then 
reduced to the simple type already considered in example (i). However, 
this process, which amounts to a conversion of a given set of boundary 
conditions to an equivalent simpler set, may not always be convenient. 
It is sometimes preferable to deal directly with the boundary con¬ 
ditions in their original form. To express (2) concisely, we introduce 
the symbol D 0 to mean the differential operator 



The conditions can then be written as 

-3Z> 0 -2, -2 % W/ 0 )l = 1“ 7-|.(3) 

2, 2Z> 0 4-1 x 2 (t 0 ) 5 

Z) 0 -2, -3Z) 0 -2 -1 

. 1 , 2Z> 0 +1 J 0 . 

With a more general one-point boundary problem the elements of the 
(4,2) matrix on the left of (3) would be replaced by given polynomials 
in D 0 , and a given column would be assigned on the right. Methods of 
solution with the boundary conditions in the unreduced form are 
given in § 6-4. 

6-2. Characteristic Numbers. In §6*1 it was assumed that the 
number of the supplementary conditions exactly equals the number n 
of the free constants in the general solution. It was also tacitly assumed 
that in such cases the values of those constants would be uniquely 
determinable. However, if the differential equations themselves con¬ 
tain one or more variable parameters, free for choice, it may be possible, 
for particular values of these parameters, to obtain solutions in which 
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all the free constants are not uniquely fixed, even though just n supple¬ 
mentary conditions are assigned. Alternatively, it may be possible— 
for special values of the parameters—to satisfy more than » supple¬ 
mentary conditions. The study of these particular values, which are 
known as characteristic numbers, is of fundamental importance in many 
physical applications, such as the determination of the natural fre¬ 
quencies of vibration of dynamical systems. 

Example 

Consider the transverse oscillations of a string stretched between the 
points y = 0 and y — l. Let /i denote the mass per unit length of the 
stretched string and T the tension: then the transverse displacement 
z at any time t satisfies the differential equation 

dh_ a d*z 
dy a ~ a dt*’ 

where a 2 = /ijT. Now assume a free vibration of the string to be 
z = Z sin a)t, where o> is a parameter left free for choice and Z is a func¬ 
tion of y only which satisfies the equation 

+<">**- 0 .( 1 ) 

and the two supplementary conditions Z = 0 at y = 0 and y — l. 
The general solution of (1), which contains two arbitrary constants, is 

Z = c 1 e ia **'+c 2 e- iamv , 

and the supplementary conditions require that 

c 1 +c a = 0,1 

c 1 e <a " , +c 2 e - *" rf = 0.J .' 

In general, the only solution of equations (2) is c 1 = c 2 — 0. However, 
in the present case a> is free for choice, and whenever it is a root of 
sin (cud) = 0, equations (2) are satisfied provided c x = — c 2 . The char¬ 
acteristic numbers (o are accordingly n/al, 2n/al, etc., and the solution 
corresponding to the rth of these numbers is Z = csin (my/l), where c 
is a free constant. 

6*3. Notation for One-Point Boundary Problems. Attention 
will here be restricted to the type of boundary problem in which no free 
parameters occur in the differential equations and the number of the 
assigned conditions exactly equals the number of the free constants 
in the general solution. 
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The simplest, and normal, type of problem is where the differential 
equations are of order N and the determinantal equation A(A) = 0 
has Nm roots, so that n — Nm. In a one-point boundary problem it is 
then usual to have assigned the initial values of the m variables x and 
of their derivatives up to, and including, the order N—l. The problem 
will be said to be of standard type when the differential equations and 
the boundary conditions have the foregoing special features. 

In the general one-point boundary problem the assigned conditions 
consist of n independent linear combinations of the values at t = t 0 of 
the dependent variables x, and (or) of the derivatives of these variables 
up to a given order 8. Such a general set of n conditions may be repre¬ 
sented by (see example (ii) of § 6-1) 


0(A)) *Wo) ~~ 

01l(A>)> 012(A))* •••> 01 ro(A)) 
02l(A))> 022(A))* •••> 02m(A) 


X\ (^o) 

^2(^0) 


-1 

0 0 


_0»l(A>)> 0nz(A>)> •••» 0nm(A>)_ 


0). 


kl 

.( 


The (n, m) matrix <f>(D 0 ) has for elements polynomials* of Z> 0 of degree 
8 at most, while 3> x , <S> 2) etc. are given constants. 

The set of conditions (1) may be expressed alternatively as 

(BqDq + At A! -1 + • ■ • + A—l A> + B a ) x(t Q ) = O, 
in which B 0 , B v etc. are (n, m) matrices of given constants. Such a set 
of boundary conditions may conveniently be referred to as being of 
order 8. If 8 > N — 1, it is clearly always possible to reduce the order of 
the supplementary conditions to N — 1 by successive substitutions 
from the differential equations. 

When the problem is of standard type <f>(D 0 ) is an (iVm, m) matrix, 
which can be partitioned into square matrices of order m and con¬ 
veniently represented by 

t(D 0 ) = {I m ,I m D 0 . I m D»-'} .(2) 

In § 6-4 a direct solution of the differential equations will be 
obtained which is applicable to the general one-point boundary 
problem. When the problem is of standard type a special form of 
the solution can be used which is often more convenient than the 
direct solution (see §§ 6*5 and 6-6). 

* In connection with boundary problems of this general type it is important to observe 
that the ohoioe of the polynomials <£<y(A>) and of the constants is not altogether unre¬ 
stricted. The boundary conditions must be compatible with the differential equations. 
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Examples 

(a) Problems of Standard Type 

(i) Single Differential Equation . If the given differential equation is 

D»x+p 1 D n ~ 1 x +... +p n x = £(<), 
then m = 1 and N = n. Hence (1) and (2) give 

0(A>) *(*o) = {^> A)> *• •> ^ *(*(>) = ®2» • • •> 

(ii) Linear Dynamical Equations of Lagrangian Type (see example 
(ii) of § 5-5). If there are m generalised coordinates q , then N = 2 and 
n * 2m. Hence, assuming given initial values <S> for the sets of 
quantities q and q, we have 

0(A>)9A)) = 4®o}?(W = 


(6) Problems of General Type 

(iii) A System of Fourth Order. Consider the homogeneous equations 

f(D) x = p + 2, Z) 3 + 2D 2 - 11 pcJ =0. .(3) 

L D\ D *-1 JU 2 J 

Here m = 2 and iV = 4: but A(A) = A 2 —A — 2, so that n = 2(4=miV). 
Only two boundary conditions can be assigned in this case, and the 
problem is not of standard type. The simplest set of supplementary 
conditions here would be and x 2 (t 0 ) = 0 2 , corresponding 

to <f>(D 0 ) = I 2 . For a standard problem with m — 2 and N = 4, we 
should require n = 2 x 4 = 8, and the values of a; and of its first seven 
derivatives at t = t 0 would be assigned. 

(iv) Change of Order of Boundary Conditions . The formulae of § 5-9 
can be applied to change the order of a set of one-point boundary con¬ 
ditions <f>(D 0 )z(t 0 ) = O. In (5*9*8) write <f> for G, t 0 for t , D 0 for D: then 


* - AID\r(f ^ v v . <f>(D 0 )F(D 0 ) 

o - <4(A>)*«o) = r 2——*(*»)+—ATzy-^ 0 ) 


n 

- S 


a 


r?xD 0 -K 


£(«o), 


.(4) 


a> 

where C' r = ^(A r ) F(A r )jb.{\). Since/(£>„)-/(A,) is exactly divisible by 
Dq — \ and the quotient is of degree N — 1 in D 0 , the given boundary 
conditions are expressed by (4) as an equivalent set of order N — 1 . 
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For example, let the differential equations be (3). Here A x = 2 and 
A a = — 1, and it is readily verified that (4) yields the relation 

<t>(Do)x(t 0 ) - W(2){15,-4}[-Z> 0 -l, 

-i^(-l){0,-l}[-Z> 0 -h2, -D* 0 + 2Dl]x(t 0 ) .(6) 

With equations (3) only two supplementary conditions can be assigned. 
If the order of these exceeds 3, they are reducible by (5) to an equivalent 
pair of order 3. Suppose, however, that the given conditions are simply 
Xi(t 0 ) = and x 2 (t Q ) = <I> 2 , corresponding to 0(D O ) = ^ 2 * Equation (5) 
would then lead to an increase in order of the boundary conditions. 
Thus, with 0 = I in (5), we obtain at t - t Q 

{#£,# 2 } == J{15, — 4] (- Dq%i -~x 1 — L)\x 2 — D*x 2 ) 

- ${°> -1} (- 2^ + 2* x - jDj s a + 2Dga: 2 ), 
or SDqX! + 6^ -f 5Dga; 2 + 5 D\x 2 = 0,| 

D 0 x 1 + 2x 1 + Dlx 2 + 2D%x 2 -x 2 = O.J .^ 

It is an instructive exercise to show that these equations are a necessary 
consequence of the differential equations (3). As already explained in 
§ 5*9, equation (5-9-8) is not an identity unless the quotient Q is absent 
in (5*9*2). In the present example such a quotient would arise because 
the degree in D of F(D) exceeds that of A(D): and it would actually 
give rise on the right-hand side of (5) to an additional term 

p5 + A> + 3, - A) - 3] p 0 + 2, + 2D\ - 11 x(t 0 ) .(7) 

L - 1 , 0 JL Dl Dg-^1 J 

The relation (5), as thus modified, vould have been an identity for 
all values of x. On the other hand, the matrix product (7) contains 
f(D 0 )x(t 0 ) as its last two factors, and this is null by (3): for this reason 
the product is omitted from (5). The conditions that the fully expanded 
product (7) shall be null are precisely the equations (6). 

6*4. Direct Solution of the General One-Point Boundary 
Problem. Before obtaining the direct solution in its most general 
form, we shall consider a simple case. Suppose the differential equations 

tob.giv.nby A1+B4 + C,- 0, 

representing the small free motions of a dynamical system with m 
generalised coordinates q. Let the n( = 2m) roots Ay of 

A(A) = |^A* + £A + (7| =0 
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be all distinct, and let the values q(t 0 ), q(t 0 ) be assigned. Now the 
most general solution of the foregoing equations is expressible as 

q(t) = kM(t)Cy .( 1 ) 

where k and M areas defined in §5* 10, andcis a column of n arbitrary 
constants. Moreover, since in the present case the roots A, are assumed 
to be all distinct, the (m, n) modal matrix k contains only constant 
elements. 

On differentiation with respect to t, equation (1) gives 

q(t) = kAM(t)c, 

where A is the diagonal matrix of the roots A r . Hence when t = t 0 we 
have q(t 0 ) — kM{t 0 )c and q(t 0 ) = kAM(t 0 )c. These two relations can be 
combined into the single matrix equation 

{(7(^o)j 2(*o)} = mto) c > .(2) 

where l is the square matrix of order n defined by 


1 = 

■ Jc- 

= 


k ln 


JcA. 


^ml 

k 

. . . 




Ai&n 

... \ik ln 




Ai&ml 

• • • A n k mn _ 


Solution of (2) for c yields 

c = M( — t 0 ) l~ x {q(tQ), q{tjj}> 

and substitution of this value for c in (1) gives the direct solution 
q(t) = kM (t — * 0 ) 0 ), 3 (^ 0 )}- 

The corresponding solution for a non-homogeneous system of general 
order, and for a general one-point boundary problem, can be obtained 
by a similar method. Assume the equations to be 


f(D)z(t)-m = 0, (3) 

and let the boundary conditions be 

<f>(D 0 )x(t 0 ) = 4>. (4) 

The general solution may be written 

z(t) = k{t)M(t)c + P(t). (5) 
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The first term on the right is the complementary function obtained in 
§5-10, while P(t) denotes any particular integral as found by the 
methods of §5-11. On premultiplication of both sides of (5) by <f>(D) 
it is readily proved that 


<j>(D)x{t) = l(t)M(t)c + <f>(D)P(t), 
where l(t) is the square matrix of order n 




^n(0’ ^ia(0> •••> huff) 

W(0> •••> W(0 


(«) 

(7) 


LUO. 0v*(0> • • •. U(0 J 

the sth column of which is given by 

{WO} = 9HA 4 +z>){U0}- 

In (6), which is true for all values of t, substitute t = t 0 and premultiply 
throughout by M{ — t a ) Then using (4), we obtain 


c = M (-«„) l~\t 0 )<b-M(-t 0 ) H(<o) P(t 0 ) .(8) 

Substitution of this value for c in (5) yields the direct matrix solution* 


x(t) = k(t) M{t - g l~*(t 0 ) {<D - <f>(D 0 ) P(t 0 )} + P(t) .(9) 

A more symmetrical equivalent is 

x(t)-P(t) = mM(t- gH(g <f>(D 0 ){x(t 0 )-P(t 0 )}. ...(10) 

When the boundary problem is of standard type,f so that <j>{D 0 ) has 
the special form (6-3-2), the matrix l is expressible in the partitioned 
form 


1(0 = 


wo, 

(Aj.+ D) ^i(0> 


k 2 (t), 

(A 2 +D)W0. 


K(t) 

(\ n +D)k n (t) 


KAi+^-^^O, (Aj+iO^WO, ---> (An+iJ^WOj 

.( 11 ) 

where each element is a column matrix. If further the roots of A(A) = 0 
are all distinct, the elements W<) are all constants, and the brackets 

* Other forms of this solution can be obtained by replacing k(t) by k(t -r), where r is 
arbitrary, and defining the matrix l(t) to correspond (see remarks following equation 
(5-10-1)). 

f For an alternative method of solution for problems of standard type see § 6*5. 
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indicating the dependence of these coefficients on t can be omitted. 
The formula (11) then reduces to 


1= ~ k v k 2 , k n "I. .(12) 

A 2 k 2 , .... ^nk n 

\”-'k ly \”-'k t ,...,AZ-'k n _ 

Examples 

(i) A System of Fourth Order. Consider the homogeneous equations 
(6 ‘ 3 ' 3) f(D) x m TD + 2, D 3 + 2D 2 -11 IxJ = 0. 

L D\ D*-l JU- 

Here, with A x = 2 and A a = — 1, 

F(*i) = \ 15 -15i = r 151 [1,-1]; 


giving 


JXAJ-J 15 -15J = |- 
W-j- 0 °j = |‘ 0j [1, — 1], 

4 -[‘:4 


If the initial conditions are and x 2 = 0 2 at t 0 = 0, corresponding 

to <f>(D 0 ) — / 2 , then by (1)1 = k and 


' 15 0- 

.-4 -1. 


4 151 


Hence the solution is 


* = -&ri5 oi re“ oir-i oirnu. 

L-4 — lj Lo e-'JL 4 15j|a>J 

(ii) Linear Dynamical Equations of Lagrangian Type. If the values 
at t = $ 0 of the m generalised coordinates q and of the m generalised 
velocities q are assigned, then 


In this case 


i>(D 0 ) = {I m ,I m D 0 }. 


L(Ai + D) ki(t) y (A a + D) k^(t)y •.., (A^+ D) k n 
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(iii) A Modified Form of the Direct Solution . If the value of c 
given by (8) is introduced in (6) instead of in (5), the result is 

<f>(D) x(t) = 1(1) M(t - t 0 ) l-\t 0 ) {<& - <f>(D 0 ) P(t 0 )} + <f>(D) P(t), 

which, on premultiplication by Z _1 (£) and rearrangement, gives 

t-Ht) m {x(t) - P(t)} = M(t -1 0 ) l~'(t 0 ) <p(D 0 ) {x(t 0 ) - P(t 0 )}. 

.(13) 

Now choose as new dependent variables the set of n functions oc(t) 
® Ven by cc(t) = l-'(t)<j>(D)x(t), 

and put J3(t) = Z _1 (0 <j)(D) P(t). 

Then the solution (13) is expressible in the simple form 
<x(t)—f}(t) = M(t — t 0 ){(x(t 0 )—fi(t 0 )}. 

In the case of the Lagrangian equations considered in example (ii) 
x = q and <f>(D) = {I m , I m D }: hence 

cc(t) = l-Ht) q{t)}, 

In dynamical applications considered in Chapter xi the quantities a($) 
are referred to as the reducing variables , since by the choice of these 
variables the equations of motion are reduced to a simple diagonal form. 


6-5. Special Solution for Standard One-Point Boundary 
Problems. In numerical applications of the direct matrix solution 
the principal difficulty lies in the calculation of the inverse matrix 
l-\t Q ). When the differential equations and the boundary conditions 
are of standard type, the direct inversion of this matrix can be avoided 
by an alternative method of solution, which will be referred to as the 
special form of solution. The intial conditions in this case'are expressed 

by <f>(D 0 )x(t 0 ) = {l m ,I m D 0 , ...,I m D»~'}x(t 0 ) = <D, .(1) 

and if the roots of A(A) = 0 are all distinct (as will here be assumed) 
the special solution is 

x(t)-P(t)= E —fj— P(K) 
r=1 A(/g 

Since (/(Z)„) — /(AJJAA) — A,) is a polynomial of degree N —1 in D 0 , the 
initial values required for an application of (2) will all be known. 


D 0 -K 


{Ak) ~ P(to)}- 
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To prove the formula, assume a column of m quantities y(t) to be 
defined by the relation 


m 


y{t)-P(t) - S 

-A(X) ^ . (3) 

which P(t) = £(<). The right-hand side of (3) is identical with 


that of (2). Now, since every column of the matrix e^F( A,) is anni¬ 
hilated by the operator /(£>), it follows that 


f(D)y(t) = m- 

Hence the functions y(t) satisfy the given differentia] equations. Next, 
premultiply both sides of (3) by the operator <j>(D) and after differen¬ 
tiation put t = t 0 . The result can be expressed as 


r-1 ~ / V 


<fi(D 0 )F(D 0 ) 

A(D 0 ) 


n 


£(<„)- S 

r=l 


d 0 -a/ (< o) ’ 


(*) 


( 1 ) 

where (7 r == ^A,) P(A r )/A(A r ). But by (6-3-4) the right-hand side of (4) 
is equal to 0(Z> O ) or f° Hence the functions y(t) satisfy the 
required boundary conditions. This establishes (2). 

With a problem of standard type the degree of P(A) in A is less* than 
that of A(A). Hence a convenient form for the particular integral such 
that P(g = 0 is (see (5-11-7)) 


p(t) = s ir-nK) 

r ” 1 A(A r ) Jt - 


( 6 ) 


If, further, we write = k r K r and 


/(D) = A 0 D N +A 1 D N ~ 1 + ...+A N _ 1 D+Aif, 

the special solution may be expressed as 

*(0-P(«)= 2 ^[A 0 *?-'+A 1 A?-* + ...+A N _ v ...,A 0 A r +A 1 ,A 0 ] 

r “ 1 A(A r ) 

x {*(<o),D 0 a:(«o), ....D^-^^JeVMP.(6) 

♦ Since f(\)F{\) =A(A)/ and /(A) is of degree N t while A(A) is of degree n = mN> it follows 
that F( A) has degree N(m - 1). 
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The special solution, when applied to homogeneous equations, is 
essentially the same as the “method of isolation” given by Routh.* 
In § 6*9 it will be shown that (2) is also the matrix equivalent of the 
well-known solution obtained by Heaviside and others by operational 
methods. 

Examples 


(i) Single Homogeneous Equation of Order n. Let the given differential 
equation be f{J))x = (A()D n + AiD n-i + ... +An)x = 0 . 

In this case A(A) =/(A) and the “adjoint” F(A) = 1. Hence (6) gives 


x(t) = E -( I y—[^ 0 A ” _1 + ^ 1 A ^ _2 + ... + ^ n _ 1 . A 0 \ r +A 1 ,A 0 ] 

x {*(< 0 )> D 0 x(t 0 ), ...,-Do -1 *^)}* 


(ii) Linear Dynamical Equations of Lagrangian Type . If there are 
m generalised coordinates q and if f(D) = AD 2 + BD+C , the formula 
(6) gives 2m , 

q(t)-P(t) = S ^-[AK + B,A}{q(t 0 ),q(t 0 )} eV'-V. 
r=1 A(A,) 


(iii) Various Identities. When the roots of A(A) = 0 are all distinct and 
P(t 0 ) = 0 the direct solution (6*4* 10) equivalent to the special solution is 

x(t)-P(t) = kM(t -g l- 1 {z(y, D 0 x(t 0 ), ...,D»-'x(t 0 )}, .(7) 

in which l is given by (6-4*12). Comparison of (6) and (7) yields the 
identity 

kM(t-t 0 )l~'= E ffi^[^ 0 A^- 1 +^ 1 A^- 2 + ...+^_ 1 


1 (1) 
A(;g 


*N-V “•> 

A^ + A^A^e^ .(*) 

Expanding the exponentials on both sides of (8) and identifying the 
terms involving the different powers of t — t 0 , we can deduce that 


W 0 

0 ^(A a ) 


0 

0 


l- 1 


[ 0 O'... ^(AjJ 

= Z ^[^ 0 A^- 1 +... + ^_ 1 ,...,^ 0 A T + 4 1 ,^ 0 ]^(A r ), 

in which ^(A) is any polynomial of A. 


♦ See Chap, vm of Ref. 13. 
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6*6. ♦ Confluent Form of the Special Solution. We shall next 
indicate briefly the modifications to the special form of solution due to 
the presence of a set of a roots equal to \ 8 . For simplicity the discussion 
will be restricted to homogeneous systems. 

As for (5-9-5), write A(A) = (A — A J} )*A 3 (A), and denote by V i the family 
of matrices 1 {1) 

<( * < ” A * )- |s-*L3A-< A.(A) L . () 


^ \s^iLd\-< A,(A) J A _ A . . 

for i= 1 ,2, ...,a. Then it can be shown that the contribution to the 
special solution arising from the set of 8 equal roots is given by the 
column . 


y(t) = s vs-to, A a )UDo, .(2) 

i=1 

where is as defined by (5-9-11). 

To prove this result, we note firstly that, as for the family of matrices 
W(t, \ 8 ) defined in § 5-7, every column of every matrix of the family V i 
satisfies the differential equations f(D) x(t) = 0 . Hence y(t) satisfies the 
same differential equations. Again, premultiplication of (2) by <f>(D) 
yields 

so that, when t — t Q , 

m - i ^5A.)x(«. 

On application of (5-9*8) and (5-9-12) with £ = 0, Q = 0, t = t 0 , and 
D = Z) 0 , it is now easy to verify that the functions y(t) given by (2) 
correctly contribute to the satisfaction of the boundary conditions. 


Example 

Lagrangian Equations of Motion of a Linear Conservative System . 
The dynamical equations in this case are of the special type 

f(D)q=(AD* + E)q = 0, .(3) 

in which A and E are both symmetrical matrices. 

It will be shown in § 9-9 that for such a system, if A(A) = 0 has s roots 
equal to \ 8 , then the corresponding derived adjoint matrices up to and 
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<»- 2 ) 


including F( A # ) are all null. Accordingly (see (1)), V { (t — 1 0 , A s ) = 0 if 
t > 1, while 1 p- 1 e^F(X)l 

(«-d 

= ^lA,(A,) f(Ai) ' 

Moreover, putting i = 1 in (5*9*11), and using (3), we find 

UK As ) ?(*o) — ~ n° ! y —1 ” ^(^o) = + 9(*o)}- 


D 0 “"^8 


The special solution is thus 

e W-t Q ) 




|*-1A,(A,) 


+ .(4) 


where £ denotes summation for all distinct values of the roots of 
A(A) = 0. 

It is to be noted that with the present system of equations the 

matrices W % (t , X 8 ) appropriate to the roots A 8 (see § 5*7) are all null with 

( 8 - 1 ) 

the exception of W s _ x (t, A 8 ) = e A ^F(X s ), so that the corresponding 8 
constituent solutions are all obtained from W 8 __ x . The modal columns 

are thus independent of t , even when multiple roots occur. The derived 
<«-D 

adjoint matrix F(X 8 ) will here be of rank 8 , and will be expressible as 
a matrix product of the type 

F(\J = fc(A 8 ) x(A s ), 

where k(X 8 ) is an (m, s) matrix with s linearly independent columns, 
which can be taken to be the modal columns appropriate to the roots 
A 8 , and x(A 8 ) is an (s, m) matrix with s linearly independent rows. 

As a numerical illustration suppose 


/(A) = 


"32A 2 + 48, — 16A a , 4A 2 ' 

-16A 2 , 32A 2 + 48, 4A 2 

4A 2 , 4A 2 , 11A 2 + 12 


and for simplicity take t 0 = 0. Then 


and 


A(A) = 3 x 48 2 (A 2 +1) 2 (A 2 + 4) 


F(X) = 48(A a +1) 


7A 2 +12, 

4A 2 , 


— 4A 2 , 


4A 2 , -4A 2 ' . 

7A a +12, -4A 2 

-4A 2 , 16A 2 +48 
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Denoting the roots of A(A) = OasA 1 = A,= +»,A,= + 2%, A 4 = A 5 = —», 
A, = — 2 », we have A 2 (A 2 ) = — 36 x 48* and A X (A S ) = 108 x 48*i, while 
U> 

F( A 1 ) = 2x48»r 5 -4 4' 

-4 5 4 

4 4 32 

and jP(As) = -3x48r-16 -16 16]. 

-16 -16 16 
16 16 -16 


The required solution, on cancellation of some numerical factors, is thus 


?(*)=— 


5—4 4 


18x48-4 5 4 -16 32 4 


4 4 32 


32 -16 4]{«jr(0)+?(0)} 


4 11 


ie 2ii r -1 -1 l]r 32 -16 4 
108_i_i 1 -16 32 4 


1 1 -1 


32 -16 4] {2*}(0) +j(0)} 


This reduces to 


4 11 


+ the corresponding conjugates. 


q(t) = J ’ 6—4 1’ {g(0)cos< + g(0)sinl} 

-4 5 1 

4 4 8 

+Af 4 4 — 1 ] {2q(0) cos 2 < + g( 0 ) sin 21 }. 

4 4-1 

-4 -4 1 

(l) 

Note that ^(Aj) is expressible as the matrix product 



6*7.* Notation and Direct Solution for Two-Point Boun¬ 
dary Problems. 

(a) Notation. In the case of a two-point boundary problem the 
supplementary conditions consist of n independent linear relations 
connecting the values at two points t = t 0 and t = f x of the m variables 
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x and of their derivatives up to order a. Such a set of relations may be 
denoted by 

y = .(1) 


where D 0 =^ j and D x = j , and <f>, \\r are (»,m) matrices of 

polynomials in their respective operators. 

Suppose, for example, the differential equations to be of second order, 
and let there be n = 2m roots A. Then if the values of the m variables 
x at both t = t 0 and t = t x are assigned, we have (J>(D 0 ) = {I m , 0} and 
f(D x ) = 


(6) Direct Matrix Solution. The treatment follows the same lines 
as for the one-point boundary problem (see § 6-4). Thus commencing 
with the general solution (6-4-5) we construct equation (6-4-6) as 
before, and the additional relation 


t(D)x(t) = l(t)M(t)c+f(D)P(t), 


-(2) 


in which l(t) is a square matrix of order n, similar to l(t) but having for 
its sth column 

ftj ='KK+D)M}- 

Substituting t = t 0 in (6-4-6) and t = t x in (2) and adding, we obtain 
O m <f>(D 0 ) x(t 0 ) +i/r(D x ) x(t x ) 

- Wo) M(t 0 ) + l(t x ) M(t x )] c + $(D 0 ) P(t 0 ) +ijr{D x ) P(t x ). 

This yields 


c = L-W - P(t 0 ) +f(D x ) P«i)}, .(3) 

where L denotes the square matrix of order n 

L(t 0 ,t x )^l(t 0 )M(t 0 ) + l(t x )M(t x ). 

Substitution of the value for c given by (3) in (6-4-6) yields the 
solution 


x(t) - k(t) M(t) L~'{4> - <P(D 0 ) P(t 0 )-ir(D x ) P(t x )} + P(t) .(4) 
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Part II. Systems of First Order 


6*8. Preliminary Remarks. The case where the differential 
equations are of first order is of particular importance not only on 
account of its comparative simplicity but also because special methods 
can be used which are not directly applicable to systems of higher 
order. Further, as shown in § 5*5, a system of general order can always 
be converted into a system of first order by a suitable choice of new 
variables. 

The general first-order system in n dependent variables y v y 2i ...» y n 

will be written trrw i n \ /*\ /i\ 

f(D)y = (vD-u)y = r/(t) f .(1) 


where v and u are square matrices of order n having constant elements. 
It will be assumed throughout that the matrix v is not singular, and 
that consequently the determinants! equation A(A) = | vA — w | =0 
has n roots. If v were singular, say of rank r, then n—r of the equations 
(1) could be replaced by purely algebraic relations. These could be 
used to eliminate n — r of the variables y, and the system would then 
be reduced to one involving only r equations and having for the 
coefficient of D a non-singular matrix (compare § 5*5). 

In the special case where v is the unit matrix, equation (1) simplifies to 


f(D)y = (ID-u)y = r/(t). .(2) 

Such a system will be described as being of simple first-order form . 
A system of the type (1) is always reducible to the simple form by 
premultiplication of the equation by tr 1 . 

In equation (1) there are n dependent variables y, whereas in 
presenting the theory of differential equations of general order we 
assumed m variables x. It is useful to preserve this distinction in the 
notation because in actual applications it often happens that the 
first-order system for solution in the variables y is derived (by methods 
such as those described in §5-5) from a system of higher order but 
containing a fewer number m of variables x. The number of the new 
variables y will generally equal the degree n in A of the determinantal 
equation of the system of higher order. 

Attention will be restricted mainly to methods applicable to simple 
first-order systems. However, at the outset we shall apply the special 
method of solution given in § 6-5 to the general first-order system, and 
trace its connection with the method of Heaviside. 
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6*9. Special Solution of the General First-Order System, 
and its Connection with Heaviside’s Method. In the present 
application of (6-5-2) there are n dependent variables y and n roots A, 
(assumed all distinct); moreover, f(D) = vD—u. Hence, if the form 
(6-5-5) is adopted for the particular integral, the special solution is 


ft r(t — to) / /* t ^ 

y(t)= 2 -(ij— .( 1 ) 

r_1 A(A,) 1 Jt * > 

We shall now briefly relate this solution to that obtained by Heavi¬ 
side and others.* The first rule in the application of Heaviside’s 
method to the solution of the system (vD — u)y = rj is to treat D 
temporarily as a constant, and to solve for y v y 2 , etc., the purely 
algebraic system of equations 

(vD—u)y = vDy(t 0 ) + r/(t). 


When expressed by matrices the result of this preliminary operation 
is clearly 

•( 2 ) 


... DF(D) F(D) 
y A/n\ W+A/ni VVh 


A (D) 


A (D) 


The next step in the method is to reinstate the operational significance 
of D on the right-hand side of each of the equations contained in (2), 
and to expand the operators concerned in powers of D~ l or Q , which 
is now interpreted to mean integration with respect to t between the 
limits t 0 and t . To evaluate the results use is made of the identity 
(compare (5-9-4)) 


h(D) _ h(0 ) 
A(Z>)"A(0)‘ 


n 

2 


D 


HK) _ 

, (1) ' D- A.’ 

1 A f A(A T ) U ** 


.(3) 


in which h(D) is a typical polynomial in D of degree not exceeding 
that of A(D). 

So far as concerns its final outcome, this treatment is equivalent to 
a direct application to (2) of the two matrix expansions 


and 


F(D) F(0 ) £ ^(A,) D 

^) =A (°) 


DF(D) 

A(D) 


= 2 


F(K) 


i (1) 

1 A(A,) 


D 

■D-K’ 


* See, for example, Refs. 16 and 17. 
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6*9 


which are obvious deductions from (3).* When used in conjunction 
with (2), these identities yield 


y(i)=x ; 

r-1 




L4(0) 


n 

s 

r-1 


( 1 ) 

KMK) 




y(t). 


Noting that DftD—X,.), when applied as an operator to unity, yields 
e A ^-<» ) (see (6*11-5) with g = l),andusing(511-6),wederive the solution 

V(t) = 2 e - m -F(A r )vy(t 0 ) + P(t) .(4) 

,_1 A(A,) 

This agrees with (1), since in (4) the form of particular integral adopted 
is such that P(t 0 ) = 0. 


Example 


Special Solution of a System of Four Equations. Suppose the 
differential equations to be 


'3 

-2 

0 

-r 

Dy- 

" 33 

20 

18 

7" 

y = 

■-2" 

0 

2 

2 

i 


-24 

-16 

-9 

-9 


0 

1 

-2 

-3 

-2 


43 

28 

11 

20 


3 

.0 

1 

2 

i_ 


.-21 

-14 

-4 

-11. 


_-l_ 


with y = {1,0,0,0} at t 0 — 0. Here 


/(A) = 


3A-33, 

24, 

A-43, 

L 21, 


-2A-20, 

2A+16, 

-2A-28, 

A+14, 


-18, 

2A + 9, 
-3A-11, 
2A + 4, 


—A—7 1, 
A+9 
-2A-20 
A+ll 


and on reduction 

A(A) = A 4 —6A a + 4 = (A*—4)(A a —1). 

The roots are accordingly A = ± 2 and + 1, and these give 

(i) (i) (i) (i) 

A(2) = 12; A( —2) = —12; A(l) = -6; A(-l) = 6. 

The corresponding adjoint matrices are found to be 


F(2 ) = 12{1, 0, -1, -1}[1, 2, 0, -1], 
^(-2) = 12{1, -1,-1, — 1} [0, 1, 1, 1], 
f(l) = 6{1,1, -2, -2} [-3, -2, 10, 18], 
JP(-l) = 6{1, 1, -2, —3}[—3, -2, 9, 16]. 


* Note that with a first-order system the degree in D of DF(D) cannot ezoeed that of 
A(A). 
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6'JMrlO 

Since vy(t 0 ) = (3,0,1,0}, the part of the special solution (1) independent 
of 7 j(t) can be written down at once as 

3e*{l, 0, -1, -1} + e-*{ -1,1,1, 1}+ ef{ -1, -1, 2, 2}. 

J t e (3-A r X_ 1 

e-*ih}(t)dt = {-2, 0, 3, -. 

Hence, after a little reduction 
4 eV rt 

S IW-FiK) e-^(t)dt 
'-'AM Jo 

= (-e«+e»){l, 0,-1,-l}+(0-4e 8< -0-4e-“){-l, 1, 1, 1} 

+ (9e“-9e0{-l, -1, 2, 2} + (4-26e s< -4-26c- < ){l, 1, -2, -3}. 
The required complete solution is accordingly 
y(t) = (4e»-e 3< ){l, 0, -1, - l} + (0-6e-« + 0-4e ! “){-1, 1, 1, 1} 

+ ( —8e'+9c 3< ){—1, -1, 2, 2} + (-4-25e- , + 4-25e 3 0{l, 1, -2, -3}. 

.(6) 

6*10. Determinantal Equation, Adjoint Matrices, and 
Modal Columns for the Simple First-Order System. 

(a) Determinantal Equation. The determinantal equation corre¬ 
sponding to (6-8-2) is 

A(A) = | A/ -it | = A n +p 1 A n_1 +... +'p n - 1 X+p n = 0, 

which is the characteristic equation of the matrix u. Moreover, the 
n roots A x , A 2 ,..., A n are in this case the latent roots of u. Hence many 
of the results obtained in Chapter m bear intimately on the theory 
of the simple first-order system. 

(6) The Adjoint and Derived Matrices. The adjoint matrix corre¬ 
sponding to any simple latent root (say A x ) is given by (see (3-8-9)) 

F( A x ) - (-1 

Further, if A x = A 2 = ... = A s , then (see (3-8-10)) 

j| W = (- l)»-i>-i[/(A.)r^-V(A, +1 )/(A. +a ).../(A n ),.(1) 

where — 1. 

For general values of A, the adjoint matrix is given by the formulae 
(see (3-8-7) and (5-9-3)) 

F( A) = tt”- 1 + (A+p x ) u n ~ 2 +... + (A* 1 - 1 + Pi A n ~ a +... +p„_!) I 
» A(A)F(A,) 

“A a) ' 

1 (A-A r )A(A r ) 
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(c) Matrices U p (t, X 8 ) of §5-10(6). The formula (1) can be used to 
derive expressions for the matrices U 0 , U v U 2 , etc., which give the modal 
columns when repeated roots occur. If, as above, A a represents a root 
of multiplicity 8 (so that A x = A 2 = ... = A s ), then 

= [4-</(a.)+|/ 2 (a,)+...+(- i) p ^m)]t/(A.)r i, - i n, 

wher e n =/(A a+1 )/(A, +2 ).../(AJ 

and p — 1. 


(d) Modal Columns . The s modal columns appropriate to a set of 8 
roots equal to A s are given by any s linearly independent columns of 
the set of matrices U Q) U v ..., U 8 _ v The typical modal column satisfies 


the relation 


[/(D+A,)-«]£,(*) = 0, 


from which it follows that the modal matrix k(t) has the property 

—ff'+K 1 ) A = uk(t), 

where A is the diagonal matrix of the latent roots. The last equation 

can be written rfjrft) 

u = k(t)A .(2) 


In the particular case where the latent roots of u are all distinct, the 
modal matrix is independent of t, and (2) then gives the usual col- 
lineatory transformation u — AAA: -1 . 


6-11. General, Direct, and Special Solutions of the Simple 
First-Order System. 

(a) General Solution. When the expression on the right of (6-10-2) is 
substituted for u in (6-8-2) the resulting equation can be written briefly 

d f t -kAk-'y-f t k-'y = r,. 

On premultiplication by M( — t) k~ l this gives 

j t {M(-t)k~'y) = M(-t)k-h,, 
and direct integration yields the general solution 

y(t) = fc(t)Af(<)(c+J‘Af(-«)*ri(0??(0d«). .(1) 
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where c denotes a column of n arbitrary constants. It should be noted 
that the particular integral in (1) vanishes at t = < 0 . When the latent 
roots of u are all distinct an alternative form of the particular integral is 

P(t) = £ \'e-Vn(t)dt. .(2) 

(b) Direct Solution . The direct solution for general one-point and 
two-point boundary problems can be obtained from (6*4*10) and (6*7*4). 
When the values of y at t = t 0 are assigned (standard one-point boun¬ 
dary problem) a convenient form of the direct solution is (compare (1)) 

y(t) = k(t) M(t) (- 1 0 ) k-'(t 0 ) y(t 0 ) + k-'(t) 7](t) d<). 

.(3) 

If, further, the latent roots of u are all distinct, the modal matrix will 
be constant, and its reciprocal can readily be calculated by use of 
(3*8*12). 

(c) Special Solution (Standard One-Point Boundary Problem ). The 
special solution appropriate to the case of distinct latent roots is at 
once deducible from (6*9*1). The formula can be written 

y(t) = | eV'-W Z 0 (K) {y(g+*J. .(4) 

( 1 ) 

in which Z 0 ( A,.) = JF(A r )/A(A r ) (see also § 3*9). 

Example 

Direct Solution . If the equations (6*9*5) are premultiplied by 
= 1, 1,-2, -41, 

0, 1, 0,-1 
-1,-1, 3, 6 

2, 1, -6, -10_ 

the result is Dy— 7, 4, 3, 2 y = —4 e 3 *. 

-3,-2,-5, 2 1 

-6,-4, 0,-4 5 

— 6, —4, 1,-5J L-U 

The direct solution (3) of this simple fibrst-order system will now be 
obtained for the initial conditions y = {1,0 , 0 , 0} at t 0 — 0. 
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The latent roots of u in this case are A = ± 2, ± 1, and 


A(2) = 12; ^(2) = 12{1, 0, -1, -1}[3, 1, 2, 0]; 

U) 

A(l) - — 6; F( 1) = 6{1,1, - 2, - 2} [1, 0, 2, -1]; 

(i) 

A( —1) - 6; J’(-l) = 6{1, 1, -2, — 3} [0, 0, 1, -1]; 

(i) 

A( —2) = -12; F( — 2) = 12{1, -1, -1, —1>[1,1,1,0]. 
Hence k = f 1, 1, 1, 1"|, 

0 , 1 , 1,-1 
- 1 ,- 2 ,- 2,-1 
.-1,-2,-3,-1 

and by (3-8*12) hr 1 - [" 3, 1, 2, 0]. 

- 1 , 0 ,- 2 , 1 
0 , 0 , 1,-1 
'.- 1 ,- 1 ,- 1 , 0 

The term independent of rj(t) in (3) is thus 


1, 1, 1, n fe“ 0, 0, 0 


0 , 1 , 1,-1 0 , e *, 0 , 0 - 1 , 0 ,- 2 , 10 


1,-2,-2,-1 0, 0,e~*, 0 


3, 1, 2, 0"|fl 


0 , 0 , 1,-1 0 


L-i,_ 2,-3,-1JL0, 0, 0, e-aJL-l,-l,-l, Oj |_0j 

= r l, i, i, nr3e*"i. .(5) 

0, 1, 1,-1 -ef 

- 1 ,- 2 ,- 2,-1 0 

.-1,-2,-3, -lJL-« - *_ 

To evaluate the particular integral, we note firstly that 
k~hi(t) = {-1.-18, 17, -2}e» 

On premultiplication by M(—t) and integration 

f M(-t)k~hi(t)dt = f {-tf, -18e*, 17e", -2c“}df 
Jo Jo 

= {—e*+1, -9e»+9, 4-25e*-4-25, -0-4e«+0-4} 

Hence M(t) \ ~ e3l+e2i T 

J 0 I — Oc® 4- Qef I 


— 9e®+9e^ 
4-26e“-4-25e- < 
_0-4e a< +0-4«-*' 
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Addition of this column to the column postmultiplying k in (5) gives 
the complete solution 


y<*) = r 1, 1, 1, 1 

0 , 1 , 1,-1 

-1,-2,-2,-1 
-1,-2,-3,-1 
This agrees with (6-9-6). 


4e*-e* 

8c*—Qe®* 

— 4-26c _< +4'26e 8< 
-0-6e-«-0-4e“ 


6*12. Power Series Solution of Simple First-Order Systems. 

The great attractions of this method are its simplicity and the fact that 
it altogether avoids the generally troublesome problem of solution of 
the determinantal equation. On the other hand, in numerical applica¬ 
tions, the process often has the disadvantage of slow convergence. 

In § 2-7 it was shown that the matrix series 

2 

e vi =I + ut + -+ 

is absolutely and uniformly convergent, and that DeV* = ue"*. It im¬ 
mediately follows that the complementary function of the differential 
equations (6-8-2) is given by ^ . (1) 

where c is a column of n arbitrary constants. 

To obtain a particular integral premultiply (6-8-2) by e _u/ and write 
the result as D(e~ vi y) = A particular integral which vanishes 

at t = t 0 is accordingly 

P(t) = e-^ViQdt. 

This leads to the general solution 


= eU< ( c+ 




In a one-point boundary problem* where the values y(t 0 ) are assigned, 

we have n 

y(t) = fe^J .(3) 

In particular if rj{t) = e et p, the last equation yields 

y(t) = ePQ-ttyfc 0 ) + e et °(I n — e^ - ^) (dl n — u)~ x p. 

The following identity, obtained by comparison of (3) and (6*11-3), 
may be noted. expti ( ( _ <o ) = k(t)M(t-t 0 )k~ l (t 0 ). .(4) 


♦ The case of a.two-point boundary problem is dealt with in § 6*13. 
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Example 

Computation of a Series Solution. As a very simple illustration of 
the method of computation, we assume the system to be 



0, r 

- 100 , 0 . 


Vi 

lyd 


and take for initial conditions y x = 0 and y 2 = 10 at t 0 = 0. The exact 
solution in this case is evidently y x = sin 10< with y 2 = 10 cos 101: this 
will provide a comparison with the series solution. 

Applying (3), with 7/(t) = 0, we have 


where 


If < = 001, 


y(t) = e“*{0,10}, 

u= r o, ii. 
L—ioo, oj 


e o-oiu _ n,oi + r 0,0-on-f jr-o-oi, o i+iro, -o-oooii 
Lo.iJ L-i,o J [ o, — o-oij [o-oi, o J 

+^ro-oooi,o l+xkr o-ooooon-f... 

[o, 0-000lJ [-0-0001,0 J 

= r 0-9950042,0-00998331. 

[-0-9983341,0-9950042] 

Hence y(0-01) = e 001u {0,10} = {0-099833,9-950042}, 
y(0-02) = e oolu y(0-01) = {0-198669,9-800667}, 

and so on. The results of this iterative process may conveniently be 
summarised in tabular form as follows: 


X 

0 

0-01 

0-02 

0-03 

0-04 

0-05 

jMO 

y.(0 

0 

10 

0-099833 

9-950042 

0-198669 

9-800667 

0-295519 

9-553367 

0-389417 

9-210614 

0-479424 

8-775831 

t 

006 

0-07 

0-08 

0-09 

1 

y«(<) 

0-564041 

8-253363 

0-644216 

7-648430 

0-717354 

0-967077 

0-783325 

6-216112 

■ 


The results for t = 0-1 agree to five figures with the values for sin 1-0 
and 10 cos 1-0 as given by standard Tables. 

As an alternative, a smaller interval might have been used at the 
outset. For instance, if t = 0-001, 
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gO-OOlu _ 


ri, on 4 - r o, o-ooii+^r-o-oooi, 

[o,lj [-0-1,0 J L 0, 

+£ro, -o-oooooon+ ... 
[o-ooooi, o J 

r 0-9999500,0-00100001. 

[ - 0-0999983,0-999950oJ 


Forming the tenth power of this matrix, we obtain 


0 

0 - 0001 . 


e 001 “ = r 0-9950041, 0-00998351, 

[-0-9983339, 0-995004lJ 

which agrees well with the value obtained above. 

The choice of step is a matter for judgment, and will depend largely 
upon the magnitudes of the elements of the matrix u. 


6-13.* Power Series Solution of the Simple First-Order 
System for a Two-Point Boundary Problem. The solution of the 
simple first-order system in power series, given in §6-12, can be ex¬ 
tended to two-point boundary problems as follows. For simplicity 
assume the equations to be homogeneous ( t\ — 0) and let the assigned 
values be {y x (y, y 2 (t 0 ),..., y 8 (t 0 )} and , y s+2 (<i), • y n (h)}- Arrange 

the n dependent variables into the corresponding subsets 

■^s = {}Ji> •••)2/«}> 

■^n —8 = 2/n}> 

and express the matrix i n the partitioned form 

W, t 0 ) 9 fi(t , £ 0 ) 1 

Ly(Mo), S(t,t 0 ) J 

where a, /?, y, $ are, respectively, submatrices of types (s,s), (s,n —s), 
(n — 8,s), (n — 8,n — s). Then equation (6*12*3), with ij = 0, gives 

r^) i = wmo),amo)][W ]> .a) 

L^n-sWJ Lr(^ ^))> ^o)J L-^n—«(^o)J 

so that, in particular 

= V ^o) -^s(^o) + ^<>) -^n-s(^o)* 

This yields Y n _ 8 (t 0 ) = to) \Xn-8(h)“~Y(h> ^o).(^) 

which expresses the unknown set of values Y n _ s (t 0 ) in terms of the 
known values T^(^ 0 ) and F n _ s (^). The solution required is then given 
by (1) in conjunction with (2). 





CHAPTER VII 


NUMERICAL SOLUTIONS OF LINEAR ORDINARY DIFFER¬ 
ENTIAL EQUATIONS WITH VARIABLE COEFFICIENTS 

7*1. Range of the Chapter, The general theory of linear ordinary 
differential equations with variable coefficients covers an immense field 
which is quite beyond the scope of this book. In the present chapter 
our principal purpose will be to indicate how matrices can be applied 
usefully in the approximate solution of such differential equations. 

Amongst the various special methods discussed, those described in 
§ 7*9 and exemplified in § 7* 10 are particularly powerful: one important 
field of application is to problems in mechanics which involve the 
determination of natural frequencies. The method of mean coefficients, 
described in § 7-11, is somewhat laborious, but it leads to good approxi¬ 
mations even when the true solution is highly oscillatory. Examples 
of the use of this method are given in §§ 7*12-7-15. 

7*2. Existence Theorems and Singularities. Linear differ¬ 
ential equations with variable coefficients are rarely soluble by exact 
or elementary methods, and it is usually necessary to resort to numerical 
approximations. In a one-point boundary problem, for example, 
where the values of the dependent variable (or variables) and of the 
derivatives up to a certain order are specified at some datum point, say 
t = l 0 , the normal procedure is to try a development of the solution in 
the form of a series of ascending powers of t — t Q . It is obviously of great 
assistance to know beforehand whether such a form of solution is 
justified, and, if so, the range of convergence. This information is 
supplied by “existence theorems”, which specifically concern the 
conditions to be satisfied in order that solutions of differential equations 
may exist, and the ranges of validity of such.solutions.* 

In connection with any given linear differential equation, it is usual 
to describe as the singularities or the singular points those points for 
which the conditions necessary for the establishment of the relevant 
existence theorem are violated. All other points are generally referred 
to as ordinary points. 

* For a detailed discussion of these questions the reader should consult a standard work 
on differential equations, e.g. Ref. 5. For an elementary exposition Ref. 18 may be 
recommended. 
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Examples 

(i) Single Homogeneous Linear Differential Equation of Order n. 
Let the given equation be 

D n X + <j>\(t) D n ~~^X 4* • • • + 1(0 ^ + $rSf) ® ==: 0, 

where D=d/dt . 

Suppose firstly that we are concerned with real quantities only 
(equation in real domain). Then the existence theorem for this equation 
states that if the coefficients fc{t ) are continuous functions of t in the 
interval a ^ t < ft, and <f) 0 (t) does not vanish in that interval, a unique 
solution expressible in power series exists which is continuous in (a, 6) 

(1) (n-l) 

and which yields assigned values for x(t 0 ), x(t 0 ), ...,a;(£ 0 ) at any given 
point t 0 of (a, ft). If the coefficients are all finite, one-valued and con¬ 
tinuous throughout (a, ft), the only singular points which can occur 
in (a, ft) are the zeros of <f> 0 (t): all other points are ordinary. 

When the variables are complex, let t 0 be a point of the Argand 
diagram (J-plane), other than a zero of <f> 0 (t), in the neighbourhood of 
which all the coefficients </>$) are analytic* functions of t. Then a 
unique series solution in powers of t — 1 0 exists which satisfies the given 
conditions at t = t 0 . This series converges absolutely and uniformly at 
least within the circle having £ 0 for centre and passing through that 
singularity of the set of coefficients 0 , 0 > • <f>J<f> 0 which lies 

nearest to t 0 . 

(ii) The Standard System of First-Order Equations . If the system of 
equations Dy = u(t)y + rj(t) is in the real domain, and the elements 
u {j (t) and y^t) are continuous in the interval (a, 6), then a continuous 
solution y(t) exists which is unique in (a, ft) and which yields given 
values y(t 0 ) at a given point t 0 of the interval. Moreover, if the elements 
concerned are continuous for all positive and negative values of t (e.g. 
when Uy and rji are polynomials in t) y then the solution will be con¬ 
tinuous for all real values of t. 

Problems frequently arise in which the elements u if and ^ satisfy 
the condition of continuity in (a, ft) and are at the same time functions 
of a parameter, say K, real or complex. If K is complex, we shall 
assume it to be restricted to such a region R of the Argand diagram 
that Uy and rj i are analytic functions of K at each point of R. In this 

* That is, the coefficients are single-valued, continuous, and admit a unique derivative— 
namely, a derivative independent of the direction of approach. 
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case a unique solution y(t> K) exists which is continuous with respect 
to t and analytic in K. For example, if the coefficients are integral 
functions (or polynomials) of K , the solution will itself be integral in 
K and uniformly convergent for all values of K when a^t^b. This 
solution will accordingly be of the type 

y(t,K) = U 0 (t) + U 1 (t)K+U 2 (t)K*+... 9 

and if the assigned conditions at t = t 0 do not themselves involve K, 
the first term U 0 (t) of the series must alone satisfy the appropriate 
initial conditions, while the remaining terms U^t), U 2 (t ), etc. vanish at 
t = t 0 . This often provides a powerful method of solution. 


7*3. Fundamental Solutions of a Single Linear Homo¬ 
geneous Equation. Let the given equation be 

D n x -f- <f>i(t) D n 4" • •. 4* Dx 4- $ n (t) x = 0, 

and consider firstly the solution relative to an ordinary point, namely, 

(1) (n—1) 

a point t 0 other than a zero of (f> 0 (t ). If the values x(t 0 ) t x(t 0 ), ..., x(t 0 ) 
are assigned, the required solution may be taken as 

(1) (n-l) 

x(t) = X 0 (t) x(t 0 ) 4- -2Ti(£) x(t 0 ) 4-... 4- x(t 0 ) , 

in which X 0 (t), X 1 (<), ..., X n __ x (t) are n special solutions, known as the 
fundamental solutions. These solutions satisfy the n distinct sets of 
initial conditions 

(1) (2) (n-l) 

= *•(« = * 0 (W ^ •••> X Q (t 0 ) = 

(1) (2) (n-l) 

*^1(^0) = *i(*o) = L *i(*o) = •••> Xi(* 0 ) — 

and so on. The fundamental solution X r will be a series of the form 


X r (t) = 


(*-*o) r 

fc 


4 ~ CL r 


(<-<o) n 

Izl 


+ a r,n+1 


(t-tp ) n+1 
|^+ 1 


4 ".... 


The ordinary procedure is to substitute this series in the given equa¬ 
tion, to arrange in ascending powers of t — 1 0 , and to equate to zero the 
coefficients of the successive powers. The constants a rn> a r n+1 , etc. are 
then found from recurrence relations. 

In dealing with a singularity, it is usual as a preliminary step to 
transform the independent variable in such a way that the singularity 
is brought to the origin t = 0; for example, in the discussion of a 
singularity at infinity, the appropriate substitution is t = 1/r. This 
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procedure clearly involves no loss of generality. Proceeding then by 
the well-known method of Frobenius we assume a series solution 

x(t y p) = 2 

V-O 

in which the exponent p is not necessarily a positive integer. The trial 
solution is substituted in the differential equation, and in the first 
place the coefficient, say P(p), of the term of lowest degree in t is equated 
to zero. The nature of the singularity depends on the nature of the 
roots of the indicial equation 

P(p) = 0. 

If the indicial equation has n roots, the singularity is said to be regular . 
On the other hand, if P(p) is of degree less than n y or is independent of 
p 9 the singularity is classed as irregular . 

If, as will be assumed, the singularity is regular, it will be possible 
to write the differential equation in the form 

f(D) x = t n D n x + F-'P^t) D n ~ l x +... + tP n _ x (t) Dx + P n (t) x = 0 y 

where the functions P^t) are all finite, one-valued and continuous in an 
interval embracing t = 0 if t is real, or else are analytic near t = 0 if 
t is complex. 

In the case of a regular singularity, provided that no two of the n 
exponents differ by zero or an integer, there will be n fundamental 
solutions of the assumed type, and the coefficients of each series will 
be determinable directly by recurrence relations. If there are multiple 
roots, or roots differing by integers, there will still be n fundamental 
solutions, but some of them will involve logarithmic elements.* 

7*4. Systems of Simultaneous Linear Differential Equa¬ 
tions. Adopting a notation similar to that used in § 5*1, we may write 
a system of m linear differential equations of order N connecting the 
m dependent variables x l9 x 2 ,...,x m with t as 

(<t> 0 D» + <t> 1 D"-i + ... + <t> N _ 1 D + 'f> N )x = £(<). 

The coefficients <j) i are square matrices of order m, having for elements 
assigned functions of t. A system of the foregoing type is clearly 
reducible (in an indefinitely large number of ways) by methods similar 
to those illustrated in § 5*5 to a system of, say, n differential equations 
of the first order. In accordance with the notation of §6-8, the most 


* For a description of the treatment in these cases see § 16*3 of Ref. 5. 
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general system of »linear equations of the first order may be expressed 
“ (vD-u)y = V (t). .(1) 

Here v and u are square matrices of order n whose elements are given 
functions of t. When v — I n the system is referred to as simple. For 
ranges of t in which v is not singular (I) can be reduced to a simple 
system by premultiplication throughout by ir 1 . 


Example 

Reduction of a Single Linear Equation to a First-Order System. To 
illustrate the methods of reduction, consider a single linear equation 
of order n ^ t) ^ + ^ j)n-i x + ... +<f> n ( t ) x = £(<). 

Writing (as for example (i) of § 5-5) 

y =(x, Dx, •••, D n 

we derive the system of n first-order equations 

ri o ... o oiDy = r oi o ... o ly+roi. 


Lo 0 ... 0 L-<f> n -^ n _ 2 ... U(f)J 

For all ordinary points (i.e. for all points other than the zeros of <f> 0 (t)) 
this equation can be replaced by the simple system 

Dy= r o i o ... o -\y+r o -|. 


L ^n/00 “0n~l/0O “^n-2/00 ••• L£/0O-I 

The following is an alternative scheme of transformation which has 
been used by Baker* in relation to the Peano-Baker method of 
integration (see §7-5). Retaining his notation, we suppose the given 
(homogeneous) equation to be 

D n x = ^ D n ~ l x + -r^j- Df*-*x + ...+ . ^ P ° . X, 

Yn Yn-lYn Y1Y2 • • •Yn 

and put y x = z;y 2 = <p x Dx; y n = <j> x <(> 2 ... 


* See p. 343 of Ref. 6. 
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This transformation of the variables yields the first-order system 


Dy = 


o 

0 

0 


l/0i 

0 


0 

l/0 2 

H 9 


0 

0 

1/08 


0 

0 

0 


0 

0 

0 


-*n -2 l/0»-X 


0 0 0 0 ... H, 

LPoltn PJtn PSn PJ*n - P-i/0. + 


in which 


« ( 1 ) 

#s = 2 0r Mr- 
r—1 


( 2 ) 


(i) 


For instance, if (f> l = 0 2 =... — <p n = <t>, then = 8<f> /$, and (2) 
simplifies to 




0 10 

( 1 ) 

0 0 


1 


0 

0 


( 1 ) 


0 0 20 1 


0 

0 

0 


0 

0 

0 


0 


0 


( 1 ) 


L A 


(1) I 

1 x 2 x 3 ••• x n—2 P 71 -I +( n ~ J 


(n — 2)<f> 

Pi P2 A .. P„ 

An important special case is where <f> = t and the functions P n _u P n _ 2 , 
etc. are polynomials in t. The given differential equation then is 

t n D n x = + P n _ 2 t n ~ 2 D n ~ 2 x -f... + P 0 a;, 

and the equivalent first-order system is 


tDy = 


1 

1 

0 


0 

1 

2 


0 

0 

1 


0 

0 

0 


0 

0 

0 


y- 


0 0 0 0 ... n-2 1 

L-Po Pi P 2 ^3 *.. Pfi—2 Pn—i ^ i J 

7*5. The Peano-Baker Method of Integration. This method 
of integration of the simple first-order system of differential equations 
was introduced by Peano in 1888, but the more recent developments 
are due to Baker.* The principle of the method is extremely simple. 


* Historical notes are given in Ref. 19: Bee also Ref. 20 and § 16*5 of Ref. 5. 
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Considering firstly homogeneous systems, we suppose the given 
equations to be Dy = u(t)y, 

and assume a set of values for y to be assigned at t = t 0 . Now it has 
been proved in §2*11 that the matrizant £!{,(«) has the fundamental 
property DQ{,(u) = uQj t (u). 

Further, when t = t 0i &{ 9 (u) reduces to I n . Hence 

y(<) = Ql.(u)y(t 0 ) .(1) 

satisfies the given system of equations and yields the required values 
y(t 0 ) at t = t 0 . This is the Peano-Baker form of solution. A feature of 
the solution is the wide range of its validity when t is complex; it 
extends to all paths of integration which do not encounter any of the 
barriers referred to in §2*10. On the other hand its usefulness for 
practical computation is apt to be limited by slow convergence. 

The matrizant method of solution can readily be applied to the 
non -homogeneous system 

Dy = u{t)y + rj{t). 

For, when premultiplied throughout by 0~ 1 (i^), this equation can be 
expressed as D^^y) = Q~y 

Hence a particular integral which vanishes at t = t 0 is 
P(y) = n(u)Ql{a-\u) V (t)}. 

The solution required is accordingly 

y(t) = Q(u) y(t 0 ) + Q(u) Qj^Q-^u) y(t)j. 

By expression of the matrizant in the appropriate partitioned form 
and a treatment similar to that adopted in § 6*13, it is possible also to 
obtain a formal solution for the case of a two-point boundary problem. 
The detailed construction of these formulae may be left to the reader. 

7*6. Various Properties of the Matrizant. A useful property* 
of the matrizant is immediately deducible from equation (7*5-1). 
Suppose t x to be any point of the interval (t 0 , t). Then 

y(ti) = ty.(u)y(t 0 ) 

and similarly y(t) = £2f i («)y(< 1 ). 

It follows that 

* Equation (1) was originally given in Ref. 21. Equation (2) is due to Baker (Refs. 6 
and 20); and equation (3) is equivalent to one given by Darboux (Ref. 22). 
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More generally, if the complete interval (t 0i t) is divided into any 
number of smaller intervals ( t Q , t x ), (t v t 2 ), ( t 8 _ lf t ), then 

ty'(u) = £ 2 L••• QfcMG&tO.(1) 

A further property is expressed by the identity 

£l(u + v) = 0(w)£l(F), .(2) 

in which V = f2 _1 (u) vO(w). 


To prove this, consider the system of equations Dy = {u + v) y, and 
let the values y(t 0 ) be assigned. Then if y = C2j e (u) Y, the system reduces 
to BY = VY. Since y(t 0 ) = F(2 0 ), the solution of this last system of 
equations can be written Y(t) = O(F)y(£ 0 ), or y(t) = £l(u)£l(V)y(t 0 ). 
On comparison of this with the solution y(t) = 0(w + v)y(t 0 ) f obtained 
from the equations in their original form, the required result follows. 
Finally, it may be noted that 

log A = J* (u n + w 22 +...+ u nn ) dt, .(3) 

where A = | Q(u) |. The method of proof is general, but for brevity 
assume u to be a square matrix of order 2. By differentiation of A 
with respect to t , 


(1) 

A = 

(1) 

On 

&12 

+ 

On 

(i) 

O i2 


(1) 

Q 21 

o 22 


Q 2 i 

(i) 

^22 


(i) 

Since Q = uQ, the preceding equation can be written 
(1) 

A = “f ^12^21> ^12 I “t* ^11> ^11^12 ^12^22 

^21^11 “b ^22^21 > ^22 I I ^21> ^21^12 ^22^22 

= (^n + ^22) A. 

This yields on integration log A = I (u xl + u 22 ) dt. 

Ju 

7*7. A Continuation Formula. Suppose the solution of a 
system of linear differential equations to be required over the interval 
(t 0 ,t). If all points of the interval are ordinary, it is theoretically 
possible to obtain the solution by use of the fundamental series solu¬ 
tions relative to the initial point t 0 . On the other hand, for purposes of 
practical computation, it may sometimes be preferable to employ the 
series relative to t 0 only up to some intermediate point of the range, 
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say t v and to continue the solution thereafter by the use of series 
appropriate to the point t x or to some other point of the interval (t v t). 
More generally, if n — 1 successive intermediate points t l9 1 2 ,t n _ x are 
taken, so that the whole interval (t 09 t) is divided into n steps, it is 
possible to base the computation in a typical step on the series rela¬ 
tive to a suitable point of that step. The solution is carried over 
from step to step by identification of the initial conditions for any 
step with the end conditions for the preceding step. This fitting 
together of a sequence of solutions will be referred to as the method of 
continuation. We shall now obtain a matrix formula which expresses 
the process in a concise form. 

The treatment is applicable in principle to linear differential equa¬ 
tions of any order, but for simplicity we shall consider specifically the 
simple system of first-order homogeneous equations, 

Dy = u(t)y. .(1) 

A solution of the equations, valid in each step, will be assumed 
known. If T 8 denotes some chosen point of the 5th step (t 8 _ l9 t 8 ), 
then the solution appropriate to that step may be supposed ex¬ 
pressed either as a series of powers of t — T 8 , or in terms of the matrizant, 
or in any other convenient form. The solution to be used in the 5th 
step will be denoted by 

y(t) = H 8 (t)y(t 8 _ x ). .(2) 

Here H 8 is a square matrix of order n, which is assumed to reduce to the 
unit matrix I n when t = t 8 _ x . In the particular case where the point 
T 8 is chosen at the initial point t 8 _ x of the step, the yth column of 
H would represent the fundamental solution appropriate to the special 
set of conditions yj(t 8 _ i) = 1 and y v (t 8 _ x ) = 0 for v+j. On the other 
hand, if T 8 is situated elsewhere in the step, the columns of H 8 are 
suitable linear combinations of the fundamental solutions relative 
to T 8 . 

On application of (2) to the n assumed steps, we have the 
sequence of relations 

2 /($i) = y(t 0 ), 

8/(^2) = ^2(^2) 


y(^n-l) ~ *®n—1(^»—l) y(^n— 2 )* 
y(t) = H n (t)y(^). 
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or, say,* 

’ y(t) = h(H 8 )y(t 0 ). 

.(3) 


n 


The process of continuation adopted is thus represented by a chain of 
matrix multiplications. The reader should particularly note that the 
individual matrices of the chain will in general differ from step to 
step. 

In the construction of the formula (3) it has been supposed that no 
singularity is encountered in the interval of t under consideration, 
but this restriction is not essential. The point T 8 of the typical step can 
be chosen to be a singularity, provided the set (or sets) of solutions 
appropriate to that singularity are used. 

In applications, the method of computation will depend to some 
extent upon the problem. For example, if the values of y(t 0 ) are numer¬ 
ically assigned, the least laborious procedure is to compute the matrix 
chain U(H) from right to left; for then the product Hi(ti)y(t 0 ) can be 
calculated as a single column, which on premultiplication by H 2 (t 2 ) 
again yields a single column, and so on. Hence in this case multi¬ 
plications of square matrices can be avoided. On the other hand, if— 
as often happens—the solution is required for arbitrary values of 
y{t 0 ), the direct multiplications of the square matrices must be effected. 
If many matrices are involved, it is wise to compute subproducts in 
the manner described in example (vi) of § 1*5. This greatly facilitates 
the correction of errors, and is also valuable if a possible increase of 
the number of steps in certain parts of the range of integration is in 
view. 

Some simple examples of the continuation formula follow. 

Examples 

(i) Linear Equations with Constant Coefficients . Assume the equa¬ 
tions (1) to have constant coefficients, and choose T b = t 3 _ x in every 
step. Then H 8 (t) — exp u(t - (which is consistent with the con¬ 
dition // s (£*_i) = / n ). Hence the solution (3) is 

y(t) = exp u(t - t n _i) exp - t n _ 2 )... exp u(t x - 1 0 ) y(t 0 ). 

.(4) 

1 

* The notation II (H $ ) is used to imply a product taken in the order H n H n ^ x ... H v 
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If T 8 z=t 8 — t 8 _ x denotes the total difference of t in the 5th step, the 
formula may be written 

1 

y(t) = n (exp UT g ) y(t 0 ) 

n 

1 / \ 

or y(t) » n \I + «V + -jj 4 ■+ • • • J y(t 0 ). 

If the steps are all equal and very small, this leads to the theorem 


expw$ = lim (l + —) . 
»->«o\ n) 


(ii) Matrizant Continuation. Assume the equations to have vari¬ 
able coefficients and as before choose T 8 = t^ in every step. Then, 
adopting the matrizant form of solution, we have H 8 (t) = L with 
H 8 (t 8 -i) = I n . The formula corresponding to (3) is thus 

y(t) = £!!._,(») QfcljM... Q{ l ,(u)y(t 0 ). 

If the steps are so small that only first-order terms in r 8 = t 8 — t 8 _ x 
need be retained, then approximately 

= I+ J o n(t,_ x + t) dr +... = / + w(< 8 _i) r„. 

The solution is thus exhibited as the limiting value, when the number 
of steps is indefinitely increased, of the matrix product 

y(t) = n + 


7*8. Solution of the Homogeneous First-Order System of 
Equations in Power Series. Let the given system of equations be 

v(t)Dy = u(t)y, .(1) 

and assume that over the range (t 0 , t) of integration concerned the 
elements of the matrices v(t) and u(t), if not polynomials in t — t 0 =T, 
are at all events expansible in Taylor’s series, so that 

( 1 ) t 2 ( 2 ) 

V(t) = V(t 0 ) + TV(t 0 ) + V(t 0 ) + ..., 

( 1 ) t 2 ( 2 ) 

U(t) =U{t 0 ) + TU(t 0 ) + n U(t 0 ) + • • • • 

Unless the matrix v(t 0 ) is singular, these expansions may be written 
alternatively as , /. +V+ F,t* + ..., 1 

V - 1 (*o) u {t) — U 0 +U X T+ U 2 t 2 +..../ . 1 } 

in which V { and U i are matrices of assigned constants. 
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As a trial solution assume 

y(t) - S(t) y(t 0 ) = (/„4-A x r + A 2 r 2 +. ..)y(t 0 ) .(3) 

Here A v A 2 , etc. are square matrices of constants to be determined, 
and the values y(t 0 ) are regarded as arbitrarily assigned. On pre¬ 
multiplication of (1) by and substitution from (2) and (3), 

we obtain 

(I n + V 1 T + V 2 T 2 +...)(A 1 + 2A 2 T + 3A 3 T 2 +...)y(t 0 ) 

= (U 0 +U 1 r+U 2 T 2 +...)(I n + A 1 T + A 2 T 2 + ...)y(t 0 ). 

This must be identically satisfied for all sets of values y(t Q ). Equating 
to zero the total coefficients of the separate powers of r, we therefore 
derive a set of recurrence relations from which the matrices can 
be calculated. Thus 

A x = U 0> 

2A 2 = (U 0 -V 1 )A 1 + U v 

3^3 = (U 0 — 2V 1 )A 2 + (U 1 — V 2 )A 1 + U 2 , 

and so on. 

If v(t) = I n , so that (1) is a simple system, there is the simplification 
I^ = ^ = Fj=...= 0. The recurrence relations then give 

A x = U 0 , 

2A 2 = U 0 A 1 + U 1 =U 2 + U li 

3A Z = U 0 A t +U x A x + U 9 = Ul + U^ + U^ + Ut, 

and so on. 

It should be noted that the columns of the matrix S(r) give the 
fundamental solutions relative to the point t 0 . The jth column of that 
matrix will be the fundamental solution appropriate to the special set 
of initial conditions yj(t 0 ) — 1 and y v (t 0 ) = 0 for v=\~j. 

Example 

Solution of a Single Equation. Consider the general solution of the 
equation D 2 x + tx = 0 in the vicinity of t = 0. If x = y v and Dx = y 2i 
the equivalent first-order system is 

Dy = (t7 0 + Z7i*)y, 


‘0 1"| and 

•S3 

II 

O 

o 

0 0. 

L-i oj 


where 
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Hence 

A -^[2 !]• 
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and so on. 

This yields 
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Lo c 
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The fundamental solution appropriate, for instance, to the initial 
conditions ^(0) = 1 and t/ 2 (0) = 0 is thus 


_ t z 42 ® 28 1 9 _ _ t z t 8 t 9 

Vx ~ l || + (6 (9 + 6 + 180 12960 + " m 

t 2 4< 6 28 t a 280 < u _ t 2 < 6 < 8 t 11 

~ “|2 + ]5 j 8 ~ + [11 ~ 2 + 30 ~ 1440 + 142560 “ 

7*9. Collocation and Galerkin’s Method. These methods of 
approximation* are powerful and relatively simple, and they are 
particularly valuable for problems involving the determination of 
characteristic numbers (see §6-2). 

A single differential equation of order n without variable parameters 
will first be considered, namely 

f(D)x-m = 0 , .( 1 ) 

in which f(D) == D n + $ x {t) D*- 1 +... + <j> n {t). 

It will be supposed that an approximate solution of (1) in the interval 
t as 0 to t = T is required. The boundary conditions must be such as 
to render the solution unique, and they may be assumed to consist of 
n non-homogeneous linear equations connecting the values of x> Dx , 

* For a fuller discussion of the methods, see Refs. 23 and 24. The original papers by 
Galerkin describing his method are not readily accessible, but brief accounts are contained 
in various Russian publications, e.g. Refs. 25 and 26. 
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D*x, etc. appropriate to, say, p specified points of the interval (0, T). 
These jp-point boundary conditions are thus of the type 

B t = 6* .(2) 


for t=l,2, where the expressions B i are linear and homo¬ 
geneous in the values of x, Dx , D*x, etc. corresponding to the boundary 
stations, and the constants b i are not all zero. 

The 5th approximation to the solution, say x , is assumed to have 

the form x = x Q (t) + X 1 (t)c 1 +X i (t)c 2 +... + X s (t)c a , 

where X 0 is any function of t which satisfies the conditions (2), X v X 2i 
etc. are any convenient functions each of which satisfies the simpler 
conditions B i = 0 for i= 1, 2, ...,w, and c v c 2 , etc. are constants left 
free for choice. When the functions X are chosen in this way, x neces¬ 
sarily satisfies the conditions (2) for all values of the constants c. The 
methods of approximation to be described only differ in the way in 
which these constants are determined. The approximate representation 
of the solution can be written concisely as 

a? = X 0 + Xc, .(3) 

where X denotes the row of the s functions X l9 X 2 ,..., X 8 , and c denotes 
the columns of the s constants c l9 c 2 , ..., c 3 . 

Let e(t) denote the function obtained when the expression (3) is 
substituted for x in the left-hand side of (1), so that 

e(t) mf(D) X(t) c +f(D) X Q (t) - g(l). 

The quantity e(t) is the error in the differential equation due to the 
approximation, and for a good approximation this error should be 
small throughout the interval (0, T ). 

In the first and more obvious method, which will for convenience 
be termed collocation , the constants c are determined to give zero error 
at 8 selected points t v t 2 , ..., t 8 of the interval. If Xj(t) and 

04^1 Y*(h) - Y,(h) 

■^ 1 (^ 2 ) Y 2 (t 2 ) ... Y 8 (t 2 ) 

JSs) ym ... Y g (t t ) 

the 8 equations for the constants in this case are given by 

0c = {m-Yom. 


( 4 ) 
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In an alternative method, due to Galerkin, the constants c 
are chosen so that s distinct weighted means of the error, namely 

J T 

e(t)Xj(t)dt, for j= 1, 2, a, are zero. The equations for the con¬ 

stants may then be expressed as 

(QoX'f(D)X)c = Q$X'(Z-f(D)X 0 ) .(5) 

It can be shown that as s is increased the values of the constants c 
determined by equations (4) and (5) tend to equality.* Hence, when 
(as normally) the sequence of representations x obtained by either 
method converges, then also the sequence given by the other method 
converges to the same limit. From the computational standpoint the 
method of collocation has the advantage of great simplicity, since it 
avoids the labour of evaluation of integrals: on the other hand, 
Galerkin’s method is generally the more rapidly convergent. 

If the differential equation contains a variable parameter and is 
homogeneous, and if in addition the boundary conditions are homo¬ 
geneous, so that b i = 0 in (2) for i= 1, 2,..., n , there may be charac¬ 
teristic values of the parameter for which solutions of the equation 
exist other than y = 0. With such problems the function X Q will be 
absent from (3), and £(£) = 0. The terms on the right of (4) and (5) 
accordingly vanish, and the approximations to the characteristic 
numbers are given in each case by the condition of compatibility of s 
homogeneous linear algebraic equations. It may be noted that, in 
exceptional cases, even when the true characteristic numbers are 
wholly real, the earlier approximations to these numbers may be 
complex. However, if the representations of the solution are con¬ 
vergent, the imaginary parts in the approximation will either vanish 
or tend to zero as 8 is increased. 

To illustrate the application of the methods to problems involving 
characteristic numbers, consider the single homogeneous differential 
equation of order n, x + Kft(D) x = 0, .(6) 

where K denotes the variable parameter. The form of approximate 
solution assumed in this case is x = Xc y and each of the s functions in 
X is chosen to satisfy the n homogeneous boundary conditions B = 0. 
The Galerkin equations, for instance, then are 

(Q$X'f(D) X) c + K(QZX‘xlr(D) X) c - 0, 

(v + Kw)c = 0, 

* for a formal proof, see Ref. 23. 


or, say, 
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in which v, w are square matrices of order 8. Accordingly, the set of 
permissible values K v K 2 ,..., K 8 given by the assumed approximation 
are the latent roots of the matrix u , where 

u = — vrH. .(7) 

The methods can also be applied for the approximate solution of 
systems of linear ordinary (or even partial) differential equations. 
In such applications an approximate representation of the type 
X = X 0 + Xc , involving s constants c, is assumed for each of the m 
dependent variables. The leading term X 0 in each case must be a 
function of the independent variable (or variables), chosen to satisfy 
the boundary conditions B — b completely, and the remaining func¬ 
tions X must satisfy B = 0. In the collocation method the complete 
domain of integration (assumed finite) is divided into s regions 
a v a 2 , cc 3f and a convenient point in each region is adopted for 
collocation. The assumed expressions for the dependent variable are 
then substituted in the differential equations, and the errors in the 
differential equations are made zero at each of the collocation points. 
The final outcome is a system of ms simultaneous linear algebraic 
equations for the m sets of s unknown constants c. If Galerkin’s method 
is employed, the error corresponding to the ith differential equation 
is multiplied in turn by the functions X appropriate to the ith depen¬ 
dent variable, and the integrals of these products, taken over the 
domain of integration, are equated to zero. This process again yields 
ms algebraic equations for the unknown constants. 

Consider, for example, the system of m homogeneous second-order 
ordinary differential equations f(D) x -f Kx = 0, where f(D) is now 
assumed to be a square matrix of order m, the elements of which are 
quadratic functions of D with variable coefficients. The scalar para¬ 
meter K will be determined so that the m variables x i vanish at both 
t = 0 and t — T : this defines a set of boundary conditions appropriate 
to a second-order system. In this case the approximate form of 
solution adopted can be expressed by partitioned matrices as 



where X i now denotes a row of s linearly independent functions each 
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of which vanishes at t = 0 and t = T, and c i is a column of 8 arbitrary 
constants. Now the column of the errors in this case is 


{e,} = /(D) Xc + KXc, 
and Galerkin’s equations may be expressed as 

(Q$X'f(D)X)c + K(Q$X'X)c = 0, 

or say (V -f KW) c = 0, in which F, W are square matrices of order ms. 
Hence the numbers K are the latent roots of — W~W. 

It should be noted that the approximations given by collocation 
will (normally) be unaffected by multiplication of the differential 
equations by arbitrary factors, but that this is not true for the 
Galerkin method. However, in mechanical problems, there is an 
optimum way of applying Galerkin’s method which renders it equi¬ 
valent to the use of Lagrange’s equations (see Ref. 24). 


7*10. Examples of Numerical Solution by Collocation and 
Galerkin’s Method. The examples will be restricted to simple differ¬ 
ential equations with known exact solutions, so that in each case tests 
of the accuracy of the approximate solutions will be possible. 

(i) First-Order Equation with Constant Coefficients . Suppose the 
differential equation to be (D— 1)# = 0 with the boundary condition 
x = 1 at t = 0. The exact solution is x = exp t . 

For the approximate solution in the interval (0,1) assume 

X = 1 +c 1 t + c 2 t 2 +...+c 8 t 8 , 

so that Xj = P and Y i = f(D) X j —jp- l — P. Hence, if the method of 
collocation is used, the equations for the constants are (see 7*9*4) 

©c = {-= {1}, 

where = jt{~ x — 1{. If a — 4, and if the points chosen for collo¬ 
cation are t x = 0, < 2 = < 3 = f, < 4 = 1, then 

‘too o i [Vi = m 
I U H ^2 1 

i I If It c 3 1 

0 1 2 3 J l_cj |_1- 

These yield c x = 1, c 2 = 0-5078, c 3 = 0-1406, c 4 = 0-0703. 

The corresponding equations by Galerkin’s method are (see 7-9-6) 

Qh{t,t\ t s ,[1 -t,2t-Zt 2 <*]c = Ql{t,t 2 ,t\t*}, 



7-10 

and these reduce to 


galerkin’s method 


229 


r i 
¥ 

5 

T2 

11 

3TF 

19"! 

TD 


V 

= 

T 

l 

3 

13 

11 




1 

TS 

TO’ 

To 

2T 


C 2 



1 

20 

7 

To 

5 

TT 

25 

56 


C 3 


1 

1 

4 

17 

7 




1 

LTQ 

21 

56 

itJ 




L5-J 


giving c x = 0-9975, c 2 = 0-5149, c 3 = 0-1325, c 4 = 0-0732. 

The two different approximate solutions, and the approximation 
given by the terms in Taylor’s series for exp t up to £ 4 , are compared 
below. The results given by Taylor’s series are the least accurate. 


Fourth Approximations to True Solution 
Collocation: 1 + 1-0000* + 0-5078* 2 + 0-1406* 3 4- 0-0703* 4 

Galerkin’s method: 1 + 0-9975* + 0-5149* 2 + 0-1325* 3 + 0-0732* 4 
Taylor’s Series: 1 + 1-0000* + 0 - 5000 * 2 + 0-1667* 3 + 0-0417* 4 


Comparison of Numerical Values 


t 

0 

0*1 

0*2 

0*3 

0*4 

0*5 

0*6 

0*7 

0*8 

0-9 

1*0 

exp t 

1*0 

1*105 

1*221 

1-350 

1-492 

1-649 

1*822 

2-014 

2*226 

2*460 

2*718 

Collocation 

10 

1T05 

1*221 

1*350 

1*492 

1*649 1 

1-822 

2*014 

2*226 

2*460 

2-719 

Galerkin 

10 

1*105 

1*221 

1*350 

1*492 

1*649 

1*822 

2*014 

2 225 

2*459 

2*718 

Taylor 

1*0 

1*105 

1*221 

1 350 

1-492 

1*648 

1*821 

2*012 

2*222 

2454 

2*708 


(ii) Perfect Flow Across an Annulus. A simple problem of perfect 
fluid flow in two dimensions will next be considered. The space between 
two concentric circular cylinders is assumed filled with fluid: the inner 
boundary r = 1 is at rest, while the outer boundary r = 2 is (instan¬ 
taneously) moving with velocity U in the direction of the coordinate 
axis OX. If ijr denotes the stream-function, the mathematical problem 

d 2 f d 2 \[r 


is defined by the equation + 


= 0, in conjunction with the 


boundary conditions rjr — 0 at r = 1 and rjr = —Uy at r = 2. 

For the approximate treatment it will be convenient to write 
p = logr and a = log 2, and to express the differential equation as 


d 2 ilr d 2 \Ir __ . 

+ = 0. Hence if we write — w U 

dp 2 do 2 

tion of p to be found, we require 


jRsinfl, where R is a func- 
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and the conditions to be satisfied are R = 0 at p = 0 and R = 2 at 
p = a. The exact solution is R = f sinh /?, and this will be compared 
with approximate results given by collocation and by the Galerkin 
method. 

For the approximate solution R we shall assume 
2 

R = -P+P(P-&)( C 1+W + c 3 p 2 ). 

Then, if the collocation method is applied, and the three positions for 
collocation are chosen to be p = 0, p = a/2 and p = a, the values of 
the constants are readily found to be 

c x = 0-314268, c 2 = 0-453391, c 3 = 0-039681. 

The corresponding values given by Galerkin’s method work out as 
c x = 0-316111, c 2 = 0-448040, c 3 = 0-039563. 

The two approximations are compared below with the exact 
solution R = f sinh/?. 


p 

0 

01 

0-2 

0-3 

0-4 

0-5 

0-6 

0-69315 

Exact Solution 

0 

0-26712 

0-63691 

0-81205 

1-09533 

1-38960 

1-69774 


Collocation 

0 

0-26718 

0-53698 

0-81208 

1-09529 

1-38949 

1-69766 


Galerkin 

0 

0-26711 

0-53691 

0-81206 

1-09533 

1-38958 

1-69775 



(iii) Symmetrical Vibrations of an Annular Membrane . The methods 
will next be applied to find rough values for the frequencies of vibration 
in the fundamental mode and the first and second overtones in the 
case of a uniform membrane bounded by the concentric circles r = 1 
and r = 2. 

Let T denote the tension per unit length, m the mass per unit area, 
and z the normal displacement at radius r and time t. Then the 
equation of vibration is 

d*z__T_!dfz ldz\ 
dt 2 m\dr 2 + rdr/' 

Assume a normal mode of vibration to be z = jRsin wt, and write 
K = (ohn/T ; then (see equation (7-9-6)) 

f(D)B + mD)E^+^ + KR = 0 , .( 1 ) 

and K is to be determined by the condition of compatibility of the 
solution of (1) with the boundary conditions R = 0 at r = 1 and r = 2. 
The exact general solution in terms of Bessel functions is 

B = AJ 0 (rjK) + BY 0 (r<jK), 
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and in order that the boundary conditions may be satisfied K must be 
a root of the equation 

J 0 (slK) F 0 (2 y/K) - J 0 (2 JK) Y 0 (jK) = 0. 

The three lowest roots are found from tables of Bessel functions to be 
K x = 9*753,^2 = 39-35, jST 3 = 88-72, corresponding to the fundamental 
and the first and second overtones. 

For the approximations, the form of solution assumed is 

R = (r— 1) (r — 2) (c 1 + c 2 r + ... -f c g r® _1 ), 

and it should be noted that whereas this is a polynomial expression 
in r, the exact solution involves Y 0 (r) and therefore the logarithmic 
element logr which itself is not expansible by a Taylor’s series in r. 
In the case of the collocation method the positions for collocation are 
chosen to be equally spaced in the interval (1,2) and to include the 
extremes, so that for instance r x = 1, r 2 = 1-5, r 3 = 2 when s — 3. 
When Galerkin’s method is used the equations can be treated con¬ 
veniently as explained for (7-9-6). Some of the results obtained by the 
two methods compare as follows: 

Characteristic Numbers for Vibrating Membrane 


Method 


K t 


Exact Solution 

9-753 

39-35 

88-72 

Collocation 

5=3 

9-51 

_ 

_ 


5=4 

9-721 

36-1 

— 

Galerkin 

5 = 2 

9-87 

41-6 

_ 


5=3 

9-752 

41-79 

101-5 


It was pointed out in §7*9 that the approximations given by the 
Galerkin method will be affected to some extent if the differential 
equation is multiplied throughout by an arbitrary function. To 
illustrate this, a few additional results obtained by Galerkin’s method 
may be cited. A different representation is taken, with s = 2, namely 
R = c x (r — 1) (r — 2) -|- c 2 (r 2 — 1) (r 2 — 4), 
and the differential equation (1) is assumed to be multiplied throughout 
by a function/(r). The results for different functions/(r) are as follows: 
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7*11. The Method of Mean Coefficients. The method to be 
described is applicable in principle to systems of equations of any 
order, but for simplicity it will here be assumed that the equations 
have been reduced to the simple first-order form 

Dy = u(t)y. 

An approximate solution will be obtained valid for the range (t 0 , t) 
and for an arbitrary set of initial values y(t 0 ). 

If t v t 2 , denote any n—1 successive intermediate points of 
the range (< 0 , <), then the exact solution is (see example (ii) of § 7*7) 

y(t) = QL.(«)*te(*) ■•■fy( u )fy( u )2/( t o)- 
As an approximation we shall now substitute for each variable element 
of u in the typical interval T p — t p — t p _ x an average value taken over 
that interval, and determine the matrizant on the assumption 
that the elements of u have these constant values. Thus let U(p) 
denote the matrix of these average values in the pth interval; then 
since U(p) is a matrix of constants (see example, § 2*11) 

= expC r (p)r p . 

Hence the complete approximate solution is 

y(t) = [exp(<—< n _,) U(n)] [exp T n _ x C/(n -1)]... [exp f/( 1)] y(t 0 ). 
For conciseness we shall write this as 

y{t) = E n (t -«„_ x ) E n _ 2 {r n _ 2 )... E^y^) .(1) 

The choice of the average values for the elements of u in any interval 
can to some extent be left to the judgment of the computer. If the 
intervals are small, sufficiently accurate values may often be found 
by inspection. In general, however, a good average value for u if in 
the interval {t p _ l2 t p ) is provided by the arithmetic mean 

1 C 

Uij = 7 7 I U ij{t) dt. 

c p c p-i J tp-i 

The integration may be carried out analytically or by Simpson’s rule, 
according to convenience. 

The individual matrices of the product (1) may themselves be 
evaluated approximately by expansion in power series (see example, 
§6-12), or they may be computed exactly by the use of Sylvester’s 
theorem or by (6-12-4). For instance, if the latent roots of the typical 
matrix U are all distinct, then 

n p\ r r 

exp tU = JcM(t) k- 1 = 2 (1) E{\ r ). 

,=1 A(Ar) 
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As a simple example suppose the system to consist of only two 
equations, so that 

U=\U n U lt \. 

[u n uj 

If the latent roots of U are complex, say X 1 = /i + io) and A 2 = /£ — ia), 
then (see §6-10(6)) 

„ E7 — A 2 1 . U-XJ . 

E = exp tU = -3 -e A i T + -r-— eA * T > 

Ai — A 2 A 2 — A x 

which reduces to 

E = cos o/r + ~~~~~~ sino>rj. .(2) 

The case of real roots can be dealt with by writing i(o for 0 ) in the 
preceding formulae. 

The numerical solution of differential equations by the method of 
mean coefficients is illustrated in some detail in §§7*12-7-15.* The 
equations chosen are also soluble in terms of known functions, so that 
the accuracy of the approximate solution can in each case be verified. 
Some equations with highly oscillatory solutions are included, and 
these provide a particularly severe test of the method. 

7’12. Solution by Mean Coefficients: Example No. 1. The 

equation is assumed to be 

D 2 x+ (167r 2 e -2/ — \)x = 0, .(1) 

and a rough solution is required for the interval t = 0 to t = 4, the 
assigned conditions being x = 1-0 and Dx = 0-5 at t = 0. 

In this first example the successive stages in the work will be ex¬ 
plained in some detail. 

Write x = y x and Dx — y 2 \ then (1) is equivalent to the simple 
first-order system 

Dy= ro il [yi\=uy> 

\u 2 i OJ [t/ 2 J 

in which u 21 = | —16 7 r 2 e _2/ . Here u 21 is the only variable element of u , 
and its mean values in the successive steps will be obtained from the 
formula 

1 ft* 

U 21 (p) = U 21 dt. 

1 p "p—l J tp-x 


* The examples are taken, in revised form, from Ref. 21. 
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For the step t p ), the equation to be used is 

By = [ 0 llj/= U(p)y, 

[u n {p) oj 

the solution of which is (see (7-1M)) 

y(t) = .(2) 

The exponential can be evaluated easily since the latent roots 
/i ± id) of U(p) are ±<JU 21 (p), and are either both purely imaginary or 
both real according as U 21 (p) < 0 or >0. Hence in the formula (7-11-2) 
we write p = 0, and id) =*JU 21 (p), so that oj may be either real or 
imaginary. The solution (2) thus reduces to 

y(0= cos co(t ^amd)(t-t p _i) y(t p _ x ). 

- o) sin d)(t — ^_x), cos o)(t — t p _ t ) 

The solution for the first step (0, t x ) is y(t) = E x (t) y( 0), and the terminal 
conditions, which are the initial conditions for the second step, are 
therefore y(t x ) = E^) y( 0). Hence in the second step the solution 
is y(t) = E 2 (t — t 1 )y(t 1 ) = E 2 (t — t 1 )E 1 (t 1 )y(0). Proceeding in this way, 
we find for the complete solution in the nth step 

y(t) = E n (t — * n _x) -®n—1 (^»—i ~ ^n-2) • • • E 2 (t 2 — #i(*x) y(0). 

The computations are summarised in Tables 7* 12*1 and 7*12-2. 
The choice of the steps is a matter for judgment: obviously, the 
more rapid the rate of variation of the coefficients, the closer should 
be the spacing. Steps of 0*1 have been adopted from t = 0 to t = 1, 
steps of 0*2 from t = 1 to t = 2, and steps of 0*5 from t = 2 to t = 4. 
Table 7*12*2 gives the matrices E and the initial and terminal values 
of y appropriate to the successive steps. For the first step, the initial 
conditions are y = {1*0,0*5}. The initial conditions for the second step 
(i.e. the values of y x and y 2 o.tt — 0*1) are found by computing the 
product 

r 0*36673 0*077831 Tl-01 = T 0*40561, 

[-11*1202 0-36673J |o*5j L-10-9368J 

and so on. 

The computations actually given yield the values of y x and y 2 
(i.e. the original x and Dx) at the end-points of the steps. If inter- 
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mediate values for the pth step are required, these can be calculated 
from (2), y{t p ^ x ) being now known. 

The exact solution of (1) corresponding to the assigned conditions 
at t — 0 is x __ j /2 C0Q {± ner iy 

The graph of this function is shown in Fig. 7*12*1 and the isolated 
points marked are the approximate values of y ± ( = x) as computed in 
Table 7*12*2 (i.e. the upper figures in the third column). The accuracy, 
even in the present rough application, is seen to be very satisfactory. 



Table 7*12*1. Example No. 1 
Computation of Elements of Matrices E p 


Step 

U tl (p) 

60 

cos cot 

sin an 

1 . 

— sm an 

O) 

— co sin an 

0 -01 

-142-88 

11-953 

0-36673 

0-93033 

0-07783 

-11-1202 

0* 1-0*2 

-116-93 

10-813 

0-47015 

0-88259 

0-08162 

-9-5438 

0-2-0-3 

- 95-69 

9-782 

0-55851 

0-82950 

0-08480 

- 8-1142 

0-3-0-4 

-78-30 

8-849 

0-63339 

0-77383 

0-08745 

-6-8473 

0-4-0-5 

-64-06 

8-004 

0-69644 

0-71762 

0-08966 

-5-7437 

0-5-0-6 

-52-41 

7-239 

0-74922 

0-66231 

0-09149 

-4-7945 

0-6-0-7 

-42-86 

6-547 

0-79326 

0-60889 

0-09301 

- 3-9862 

0-7-0-8 

-35-044 

5-920 

0-82983 

0-55800 

0-09426 

-3-3032 

0-8-0-9 

-28-646 

5-352 

0-86016 

0-51003 

0-09529 

- 2-7298 

0-9-10 

-23-409 

4-838 

0-88523 

0-46517 

0-09614 

-2-2506 

10-1-2 

-17-368 

4-167 

0-67229 

0-74029 

0-17763 

-30851 

1-2-1-4 

-11-553 

3-399 

0-77770 

0-62864 

0-18495 

-2-1368 

1*4-1-6 

-7-665 

2-7685 

0-85058 

0-52584 

0-18994 

-1-4558 

1 *6-1-8 

-6-055 

2-2484 

0-90059 

0-43467 

0-19332 

-0-97727 

1-8-20 

-3-306 

1-8183 

0-93460 

0-35570 

0-19562 

-0-64677 

20-2-5 

-1-5783 

1-2563 

0-80911 

0-58765 

0-46776 

-0-73828 

2-5-30 

-0-4225 

0-6500 

0-94764 

0-31933 

0-49125 

-0-20758 

30-3-6 

+ 0 0025 

- 0*0505i 

1-00032 

-002525t 

0-50000 

+0 00127 

3-5-40 

+0-1589 

-0-3986i 

1-01993 

-0-20063i 

0-50334 

+ 0-07997 
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Table 7-12-2. Example No. 1 
Computation of the Matrix Chain EpEp_ Y E p _i ... E x y{ 0) 


Step 

Matrix E v {r p ) 

Column yitf-i) 

00-01 

0*36673 

0*07783 

1*0 


-11*1202 

0*36673 

0*5 

0* 1-0*2 

0*47015 

0*08162 

0*4056 


-9*5438 

0-47015 

-10-9368 

0-2-0-3 

0*55851 

0-08480 

-0*7019 


-8*1142 

0*55851 

-9*0134 

0-3-0-4 

0*63339 

0*08745 

-1*1564 


-6*8473 

0*63339 

0*6617 

0*4-0*5 

0*69644 

0*08966 

-0*6746 


-5*7437 

0*69644 

8*3373 

0-6-0-6 

0*74922 

0*09149 

0*2777 


-4-7945 

0*74922 

9*6811 

0*6-0*7 

0*79326 

0*09301 

1*0938 


-3*9862 

0*79326 

5*9218 

0-7-0-8 

0*82983 

0*09426 

1*4184 


-3*3032 

0*82983 

0*3374 

0-8-0-9 

0*86016 

0*09529 

1*2088 


-2-7298 

0*86016 

-4*4053 

0*0-1 0 

0*88523 

0*09614 

0*6200 


- 2*2506 

0*88523 

-7*0890 

1 *0-1*2 

0*67229 

0*17763 

-0-1327 


-3*0851 

0*67229 

-7*6708 

10-1*4 

0*77770 

0*18495 

-1*4518 


-2*1368 

0*77770 

-4*7476 

1 *4-1*6 

0*85058 

0*18994 

- 2 0071 


-1*4558 

0*85058 

-0*5900 

1 *6-1*8 

0*90059 

0*19332 

-1*8193 


-0*97727 

0*90059 

2*4201 

1*8-20 

0*93460 

0*19562 

-1*1706 


-0*64677 

0*93460 

3*9575 

20-2*6 

0*80911 

0-46776 

-0*3199 


-0*73828 

0-80911 

4*4558 

2*6-30 

0*94764 

0*49125 

1*8254 


-0*20758 

0*94764 

3*8414 

30-3*6 

1*00032 

0*50000 

3*6169 


0*00127 

1*00032 

3*2613 

3-6-40 

1*01993 

0*50334 

6*2487 


0*07997 

1*01993 

3*2670 

_ 

___ 

— 

6*9977 


— 

— 

3-7518 
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7*13. Example No. 2. Assume the equation to be 

D 2 x + tx~ 0, .(1) 


and let the solution be required for the interval t = 0 to t = 8-4, the 
conditions at t = 0 being arbitrary. 

The given equation is equivalent to the first-order system 

Dy = r 0 11 y = uy 9 

L-* oj 

where y = {x, Dx} . In this case the series solution already computed 
in the example, § 7-8, will be used from t — 0 to t = 1-8. From t = 1*8 
to t = 8*4 the method of mean coefficients will be applied. 

Table 7* 13-1, which gives details of the computation of the elements 
of the matrices E py is similar to Table 7-12-1, but in Table 7-13-2 the 
arrangement differs somewhat from that of Table 7-12-2. Since the 
initial conditions are here not numerically assigned, the continued 
product of the square matrices E p is computed, and the result is 
multiplied by the (arbitrary) column matrix y( 0). The individual 
square matrices E p are given in the second column of Table 7-13*2, 
while the continued products of these matrices, taken up to the 
beginnings of the successive steps, are entered in the third column. 
Thus, the first matrix in the third column is the unit matrix; the 
second is the product of the first matrix in the second column and this 
unit matrix; the third matrix in the third column is the product of the 
second matrices in the second and third columns, and so on. The last 
matrix in the third column is the continued product of all the matrices 
E p from t = 1-8 to t = 8-4. 

The values of y x and y 2 at the beginning of the steps, corre¬ 
sponding to known values at t = 1-8, can now be written down. For 
instance, if y = {£, yj) at t = 1-8, the value of y at t = 6-8 is 

y = I"-0-53939, -0-397681 m. 

L 1-24864, — 0-93330J 

As a particular case suppose that y = {1,0} at t = 0. The series 
solution computed in §7-8 then gives y(l-8) = {0-2023, —1-0623}. 
These values in turn provide the terminal values for y x as given in the 
last column of Table 7-13-2. 
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The exact solution of (1), with the initial conditions {x, Dx} = {1,0} 
at t = 0, is expressible in terms of Bessel functions as 

1-230* = 

The true and approximate solutions are compared in Fig. 7-13-1. 



Table 7-13-1. Example No. 2 
Computation of Elements of Matrices E p 


Step 

u n (p) 

CU 

COB CUT 

sin cut 

1 . 

— sin cur 

cu 

-cu sin cut 

ia-i-9 

-1*85 

1*36015 

0*99077 

0*13559 

0*09969 

-0*18442 

1*9-20 

-1*95 

1*39642 

0*99027 

0*13919 

0*09968 

-0*19437 

2*0-2 1 

-2*05 

1*43178 

0-98976 

0-14269 

0*09966 

-0*20430 

21-2*2 

-2*15 

1*46629 

0-98927 

0*14610 

0-09964 

-0*21422 

2*2-2*4 

-2*3 

1*51657 

0*95436 

0-29868 

0*19694 

-0*45297 

2*4-2*6 

-2*5 

1*58114 

0-95042 

0*31099 

0*19669 

-0*49172 

2-6-2*8 

-2*7 

1*64317 

0*94649 

0*32275 

0*19642 

-0*53033 

2*0-30 

-2*9 

1*70294 

0-94255 

0*33404 

0*19615 

-0*56885 

30-3*4 

-3*2 

1*78885 

0-75473 

0*65603 

0*36673 

-1*17354 

3*4r-3*8 

-3*6 

1*89737 

0*72556 

0-68816 

0*36269 

-1*30569 

3*8-4*2 

-4*0 

2*0 

0*69671 

0*71736 

0*35868 

-1*43472 

4*2-4*8 

-4*5 

2*12132 

0-29361 

0-95592 

0*45062 

-2*02781 

4*8-5*4 

-5*1 

2*25832 

0-21413 

0-97680 

0-43253 

-2*20593 

5*4-60 

-5*7 

2*38747 

0-13787 

0*99045 

0*41485 

-2*36467 

6-0-6-8 

-6*4 

2*52982 

-0-43772 

0*89911 

0-35540 

-2*27459 

6*8-7*6 

-7*2 

2*68328 

-0*54453 

0*83874 

0-31258 

-2*25058 

7*6-8*4 

-8*0 

2*82843 

-0*63804 

0*77000 

0*27224 

-2*17789 
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Table 7-13-2. Example No. 2 
Computation of the Matrix Chain E p E p ... E x 


Step 

Matrix E p 

Product E v E p _ 1 ...E 1 

Column y(t v - x ) 

1-8-1-9 

0-99077 0-09969 

-0-18442 0-99077 

1-0 0 

0 1-0 

0-2023 

-1-0623 

1-9-20 

0-99027 0-09968 

-0 19437 0-99027 

0-99077 0-09969 

-0-18442 0-99077 

0-0945 

2-0-2-1 

0-98976 0-09966 

-0-20430 0-98976 

0-96275 0-19748 

-0-37520 0-96175 

-00150 

21-2-2 

0-98927 0-09964 

-0-21422 0-98927 

0-91550 0-29131 

-0-56805 0-91166 

-0-1243 

2-2-24 

0-95436 0-19694 

-0-45297 0-95436 

0-84908 0-37901 

-0-75807 0-83937 

-0 2309 

2-4^2-6 

0-95042 0-19669 

-0-49172 0-95042 

0-66103 0-52702 

-1-10808 0-62938 

-0-4261 

20-2-8 

0-94649 0-19642 

-0-53033 0-94649 

0-41031 0-62468 

-1-37818 0-33903 

-0-5806 

2-8-30 

0-94255 0-19615 

-0-56885 0-94255 

0-11765 0-65785 

-1-52203 -0-01040 

-0-6750 

30-3-4 

0-75473 0-36673 

-1-17354 0-75473 

-0-18766 0-61802 

-1-50151 -0-38402 

-0-6946 

3-4-38 

0-72556 0-36269 

-1-30569 0-72556 

-0-69228 0-32561 

-0 91301 -1-01510 

-0-4859 

3-8-4-2 

0-69671 0-35868 

-1-43472 0-69671 

-0-83343 -0-13192 

0-24146 -1-16166 

-0-0285 

4-2-48 

0-29361 0-45062 

-2-02781 0-29361 

-0-49405 - 0-50857 

1-36397 -0-62007 

0-4403 

4-8-54 

0-21413 0-43253 

-2-20593 0-21413 

0-46958 -0-42874 

1-40231 0-84922 

0-5504 

6-4-60 

0-13787 0-41485 

-2-36467 0-13787 

0-70709 0-27551 

-0-73558 1-12761 

-0-1496 

60-6-8 

-0-43772 0-35540 

-2-27459 -0-43772 

-0-20767 0-50577 

-1-77345 -0-49603 

-0-5793 

0-8-7-6 

-0-54453 0-31258 

-2-25058 -0-54453 

-0-53939 -0-39768 

1-24864 -0-93330 

0-3133 

7-6-84 

-0-63804 0-27224 

-2-17789 -0-63804 

0-68401 -0-07518 

0-53402 1-40322 

0-2182 

8-4 

— — 

-0-29104 0-42998 

-1-83042 - 0-73158 

-0-6156 
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7*14. Example No. 3. As a more complicated example we shall 
next choose the equation 

7Tt\ 

2/ 

and obtain the solution for the interval t = 0 to t = 4. 

The equivalent first-order system is 


Dx + ^647t 2 + 64 cos 2 ~ — 47r sin ~ 


)*= 0 , 


D%r+( 16 cos 




— 647T 2 ■ 


0, 

o 7Tt A . 7 Tt 

■64 cos 2 — 4-47rsm —, 
2 2 


— 16 cos 


7 Tt 

~2i 




where 


y = {x,Dx}. 


In the present case the method of mean coefficients will be used 
throughout. The matrix u here has two variable elements u 21 and u 22 , 
and the latent roots are found to be complex, say A = /i ± i(o. The 
formula (7-11*2) gives 


y%) = e^ T 


11 . 

— -Sinwr + COS (OT, 
(O 


^21 •- 
(O 


sm cor, 


or 


y(t P ) = 


11 • 

—-sin6>T + cos (or, 
<0 

(/i 2 + (o 2 ) . 

— —-sin (or, 

(O 


1 . 

- sm (or 

(O 

(C/ 22 w) . 

—— sm (OT -f COS (OT 
(O 


y{t P -i)> 


1 . 

-sm (or 

(O 


y(tp- 1 )> 


u> . 

— am(OT +COS (OT 
(O 


since U 21 = — (/i 2 + (o 2 ) and U 22 = 2/i. 

The computations, which are otherwise similar to those for Example 
No, 2, are summarised in Tables 7-14-1 and 7-14-2. The fourth column 
of Table 7-14-2, headed lie' 47 , represents the contributions to the matrix 
chain E p E p _ x ...E 1 arising from the scalar multiplier e^ T which appears 
in the solution. It is obviously convenient to effect these particular 
multiplications independently. 

Table 7-14-2 may be regarded as completing the formal numerical 
solution. When any initial conditions y(0) are assigned, the values of 
y at the ends of the steps are at once deducible as explained in 
connection with Example No. 2. For instance, at t = 2-8, 


y(2-8) = 127-0 


' 0-5890, 

0-03761] 2/(0) = 

‘ 74-79, 

4-776' 

.-20-54, 

0-3859 J 

.-2608, 

49-00. 


y(o)- 
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The calculations have been completed in detail for the particular case 
y(0) = {1, - 8}. It can be verified that the exact solution appropriate 
to these initial conditions is 

/ 16 . nt\ 

x=y l = exp I - — sin — 1 x cos 87rf, 

and the graph of this function is given in Fig. 7-14-1. It oscillates 
throughout the time range and the amplitude varies considerably. 



Table 7-14-1. Example No. 3 
Computation of Elements of Matrices E p 


Step 

U tl 

U„ 


CO 


p . 

- - 8111 COT 
CO 

+ cos cor 

1 . 

- sm cor 

CO 

U 21 . 

— srncur 
CO 

fi . 

c sin aw 

CO 

+ cos cm 

0 -0*2 

-691-6 

-15-74 

-7-87 

25-09 

0-2072 

0 0030 

-003799 

26-27 

0-6010 

0-2-0-4 

-676-6 

-14-20 

-7-10 

25-02 

0-2417 

0-0160 

-0-03827 

25-89 

0-5594 

04-0-6 

-664-8 

-11-27 

-5-63 

24-96 

0-3240 

0-0594 

-0-03850 

25-21 

0-4930 

0-6-0-8 

-634-0 

-7-23 

-3-62 

24-92 

0-4853 

0-1286 

-003866 

24-51 

0-4084 

0-8-1-0 

-621-4 

-2-49 

-1*25 

24-90 

0-7796 

0-2152 

-003875 

24-08 

0-3120 

10-1-2 

-621-4 

2-49 

1-25 

24-90 

1-2827 

0-3120 

-0-03875 

24-08 

0-2152 

1-2-1-4 

-634-0 

7-23 

3-62 

24-92 

2-0606 

0-4084 

-0-03866 

24-51 

0-1286 

1-41-6 

-664-8 

11-27 

5-63 

24-96 

3-0864 

0-4930 

-0-03850 

25-21 

0-0594 

1-6—1-8 

-676-5 

14-20 

7-10 

25-02 

4-1371 

0-5594 

-0 03827 

25-89 

0-0160 

1-8-20 

-691-6 

15-74 

7-87 

25-09 

4-8259 

0-6010 

-0-03799 

26-27 

00030 

20-2-2 

-696-6 

15-74 

7-87 

25-17 

4-8259 

0-6129 

-003769 

26-21 

0-0197 

2-2-2-4 

-687-8 

14-20 

7-10 

25-25 

4-1371 

0-5959 

-0-03738 

25-71 

0-0651 

2-4-2-fl 

-672-5 

11-27 

6-63 

25-31 

3-0864 

0-5525 

-0-03710 

24-95 

0-1347 

2-0-2-8 

-656-3 

7-23 

3-62 

25-36 

2-0606 

0-4857 

-003691 

24-22 

0-2185 

2-8-30 

-6461 

2-49 

1-25 

25-39 

1-2827 

0-4028 

-0 03680 

23-78 

0-3108 

30-3-2 

-646-1 

-2-49 

-1-25 

25-39 

0-7796 

0-3108 

-0-03680 

23-78 

0-4028 

3-2-3-4 

-656-3 

-7-23 

-3-62 

25*36 

0-4853 

0-2185 

-0-03691 

24-22 

0-4857 

3-4-3-S 

-672-6 

-11-27 

-5-63 

25-31 

0-3240 

0-1347 

-003710 

24-95 

0-5525 

3-0-3-8 

-687-8 

-14-20 

-7*10 

25-25 

0-2417 

00651 

-003738 

25*71 

0-5959 

3-8-40 

-695-5 

-16-74 

-7-87 

25-17 

0-2072 

0-0197 

-0-03769 

26-21 

0-6129 
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Table 7*14*2. Example No. 3 


Computation of the Matrix Chain E p E v _ 1 ... E x 


Step 

Individual Matrices 

Matrix Product 

Ile^r 


0 -0*2 

0 0030 -0*03799 
26*27 0*6010 

1*0 0 

0 1*0 

1*0 

1*0 

0*2-0*4 

0*0160 -0*03827 
25*89 0*5594 

0 0030 -0*03799 
26*27 0*6010 

0*2072 

0*06369 

0*±-0*6 

0*0594 - 0*03850 
25*21 0*4930 

-1*0053 -0*02361 
14*77 -0*6474 

0*05008 

-0*04089 

0*6-08 

0*1286 -0*03866 
24*51 0*4084 

-0*6284 0*02352 

-18*06 -0*9144 

0*01623 

-0*01326 

0*8-10 

0*2152 - 0*03875 
24*08 0*3120 

0*6174 0*03838 

-22*78 0*2030 

0*007874 

0*00244 

10-1*2 

0*3120 -0*03875 
24*08 0*2152 

1*0156 000039 

7*76 0*9875 

0006139 

0-00622 

1-2-1-4 

0*4084 -0*03866 
24*51 0*1286 

0*0162 -0*03814 
26*13 0*2219 

0*007874 

000253 

1*4-1 *6 

0*4930 -0*03850 
25*21 0*0594 

-10036 -0*02416 
3*76 -0*9063 

0*01623 

-0*01315 

1-6-1-8 

0*5594 -0*03827 
25*89 0*0160 

-0*6395 0*02298 

-25*08 -0*6629 

0*05008 

-0*04123 

1*8-20 

0*6010 -0*03799 
26*27 00030 

0*6021 0*03822 

-16*96 0*5843 

0*2072 

0*06139 

20-2*2 

0*6129 -0*03769 
26*21 0*0197 

10062 0 00077 

15*77 10058 

1*0 

1-000 

2*2-2*4 

0*5959 -0*03738 
25*71 0*0651 

0*0223 -0*03744 
26*68 00400 

4*826 

1 553 

2*4-26 

0*5525 -0*03710 
24*95 0 1347 

-0*9840 -r-0*02381 

2 31 -0-9600 

19*96 

-15*84 

2*6-2*8 

0*4857 -0*03691 
24*22 0*2185 

-0*6294 0*02246 

-24*24 -0*7234 

61*62 

-49*86 

2*8-30 

0*4028 -0*03680 
23*78 0*3108 

0*5890 0*03761 

-20*54 0*3859 

127*0 

36*58 

30-3*2 

0*3108 -0*03680 
23*78 0*4028 


162*9 

160*5 

3*2-3*4 

0*2185 -0*03691 
24*22 0*4857 


127*0 

41*19 

3*4-3*6 

0*1347 -0*03710 
24*95 0*5525 


61*62 

-48*49 

3*8-8*8 

0*0651 -0*03738 
25*71 0*5959 

-0-6383 0-02227 

-16-88 -0-9788 

19*96 

-16*30 

3*8-40 

0*0197 - 0*03769 
26*21 0*6129 

0*5894 0*03804 

-26*47 -0*0107 

4*826 

1*376 

— 

— — 

1*0093 0*00115 

-0*78 0*9905 

1*0 

1000 
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Between t = 0 and t = 2, the ordinate is small: on the other hand, it 
grows to large values of the order ± 150 between t = 2 and t = 4. 
Consequently, in order to obtain a reasonable graphical representation, 
the scale has been magnified 200 times for the first half of the range. 
The approximate results computed by postmultiplication of the matrix 
chain by {1, —8} are marked in the diagram, and they accord well 
with the curve. 


7-15. Example No. 4. We shall next solve by the method of 
mean coefficients the problem treated by other methods in example 
(iii) of §7*10. 

The first-order system equivalent to (7* 10-1) is 

* - [_x -ur** 

where y x = R and y 2 = dRjdr. To obtain a preliminary estimate of 
the characteristic numbers K a constant mean value for 1/r, namely §, 
will be assigned throughout the interval r = 1 to r = 2. The solution 
of the equation is then (see (7* 11-2)) 


y( 2) = e/* 


— — sin (i) + cos a*, 
0) 


K . 

-sin (O, 

a) 


1 . 

-sinw 

0) 


y( i)> 


i 

0) 


(§ +/*)sina; + cosG> 


( 1 ) 


where /i ± io) are the latent roots of u . The boundary conditions are 
y x {2) = y x (l) = 0, and the first of the two equations implicit in (1) 

therefore yields ^ 

0 = —sinw.y 2 (l), 

whence sinw = 0 or o) = nn. But the latent roots of u , with 1/r = f, 
are — £ + — Hence oj = nn or K — 7i 2 7T 2 -f^, and 

rough approximations for K are therefore 7r 2 , 47r 2 , 97 t 2 , etc. 

Proceeding next to a more refined approximation, we assume the 
membrane to be divided into five sections by circles of radii 1*2,1-4,1*6 
and 1*8 and use an average value for the radius of each section. If also 
trial values for K are adopted, namely 9*8 and 9-9, 39*4 and 39*5, 88-8 
and 88*9, then for each value of K the elements of the successive 
matrices corresponding to the five sections are all numerically deter¬ 
minate. The value of y % at r = 1 is arbitrary: hence we may adopt as 
initial conditions y( 1) = {0,1} and deduce the end conditions y( 2). 
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Table 7* 15-1. Example No. 4 
Computation of y x ( 2) for various values of K 


0-8211 0*1713 
-1-6784 0-6670 


{»1. Vi) 

Matrix 

{»i> Vt) 

0 

0-8193 0-1712 

0 

1 

-1-6947 0-6662 

1 


1-2-1 *4 

0-8195 0-1737 
-1-7021 0-6876 

1-4-1-6 

0-8184 0-1753 
-1-7183 0-7027 


10-1-8 

0-8174 0-1767 
-1-7321 0-7149 

ia-2-o 

0-8166 0-1778 
-1-7425 0-7241 



0-8177 0-1736 
1-7186 0-6858 


0-1712 

0-6652 

0-8166 0-1752 
1-7349 0-7009 


0-2555 

0-1620 


0-2370 

-0-3297 




K = 39-4 

K = 39-5 
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Linear interpolation between the values of K can then be used to 
give the value of K for which y x ( 2) = 0. 

The results are summarised in Table 7 -15-1. The actual computations 
of the elements of the matrices E p are not tabulated; they are similar 
to those of Example No. 3 for each value of K adopted, except that 
in this case the scalar factor e 0 ' 2 ^ is absorbed into the matrices for 
convenience. Since the initial conditions y(\) are prescribed, the 
columns y are computed in succession until y x ( 2) is deduced. This 
differs slightly from zero for each approximate value of K chosen. 
Interpolation to make y x ( 2) = 0 gives the values K = 9*750, 39*36, 
88*70, which agree very closely with the values 9*753, 39*35, 88*72 
found from the tables of Bessel functions. 

The mode of displacement, which is given by y^l), j/ 1 (l*2), ^(1*4), 
^(1*6), ^(1*8) and y x ( 2), can be deduced by interpolation, or by re¬ 
calculation using the more exact values of K. 



CHAPTER VIII 


KINEMATICS AND DYNAMICS OF SYSTEMS 

Part I. Frames of Reference and Kinematics 

8*1. Frames of Reference. Before taking up the subject of 
dynamics, we shall first deal with changes of reference axes, and with 
the analysis of motion quite apart from the question of the causation 
of the motion. 

The positions which the points of any geometrical system, or the 
particles of any material system, occupy at any instant, relative to a 



Fig. 811 

datum body (e.g. the earth) can be specified by the Cartesian co¬ 
ordinates of these points or particles referred to a system of rectangular 
axes Ox v Ox 2 , Ox 3 —or frame of reference 0(x v x 2 ,x 3 )—fixed in 
that body. In usual practical applications the specification of position 
with reference to such a fixed frame of reference is treated as absolute; 
and it is convenient to regard the time-rates of change of the co¬ 
ordinates as defining “absolute velocity ” and “absolute acceleration”. 

It is frequently helpful to adopt in conjunction with the fixed axes 
one or more auxiliary frames of reference, such as 0(x v x 2 , x 3 ) in Fig. 
8*1*1, the absolute positions of which are changing in some specified 
manner with time. For the sake of definiteness we shall suppose all 
sets of axes used to be right-handed, as represented in the diagram. 
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The conventions regarding the signs of rotations are as follows. An 
axis of rotation is regarded as possessing a definite sense, and the 
convention is adopted that the positive sense of rotation about the 
axis would appear clockwise to an observer looking along the axis in 
the positive sense. For example, the positive sense of 0x x is from 0 to 
x v and a positive rotation about 0x x would move x 2 towards x 3 . 

The Cartesian coordinates of a general point P, referred to the fixed 
axes, will hereafter be denoted as x v x 2i x 3 \ while those of the same 
point, relative to the auxiliary axes, will be x v x 2 , x 3 . At the outset we 
shall consider the simple case of two-dimensional motion. 


8*2. Change of Reference Axes in Two Dimensions. Here 
the position of the auxiliary frame of reference at any instant t can be 
specified completely by the Cartesian coordinates (a v a 2 ) of 0 referred 
to the fixed axes, and by the inclination <j> of Ox x to Ox v 



If 5 s {5i, ? 2 } an( i £ = {£i> £ 2 } denote, respectively, the columns of the 
components of any vector V in the plane, measured parallel to the 
fixed and the auxiliary axes, then 

£ = .( 1 ) 

in which I = Y cos <f> sin <jf\. 

L — sin <f> cos <f >J 

The matrix I is orthogonal (see §M7) since 


Also 


jdl -1 _ _ r [" cos0 sin T 

dt ~ dt ^ [ — sin <j> cos 0J [ 


I” 1 = |“cos <j> —sin 01 = r. 

[sin (j> cos <f>\ 

sin 0] [ — sin (f> — cos <f> 
cos0 —sin^ 


3-ra- 


•(2) 
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8*2 


where p=<j denotes the angular velocity of the moving frame. For 
brevity we shall write 

On differentiation with respect to t equation (1) yields 



whence by (2) and (1) 


dt dt d& 


•f-f+«-<»+->«■ 


.<* 5 , 


Now by (1) l-~ represents the components of the time-rate of change 
dt 


of the vector V, measured in the directions of the moving axes at time t. 
In other words the components of the time-rate of change of any 
vector measured in the directions of the moving axes are obtained 
from the components £ of the vector itself in the same directions by the 
operation ID + w performed on £. For instance, if V represents the 
absolute position of a general point P of the plane, then x = l(x — a), 
or te = to-f x. Hence the components of the velocity of P measured 
parallel to the axes Ox 1 and Ox 2 are given by 


v = (ID + w)(la+x). .(3) 

In particular, if the components of the velocity of 0 in the same direc¬ 
tions are denoted by u , then u = (ID + w) la, so that (3) may be written 

v = u+(ID + m)x. 

Similarly, if the components of the acceleration of P measured parallel 
to the moving axes are a, then 

a = (ID+w) v. 

These formulae can, of course, also be established by quite elementary 
methods, but the treatment adopted has certain attractions when 
generalized to three dimensions. 


Examples 


(i) Matrices Representing Finite Rotations . If <j> is increased to 
<f> + 0 ), then I becomes 


i^r cos (<j>+o)) 
[—sin ($ + (o) 


sin (<f> = T cos <o sin ali. 
cos (<f> + (o)\ [ —sin cosa>J 
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AXES OF A CONIC 


Hence premultiplication of I by the matrix 

p((j)) = r cos (t) sin o )1 
[ — sin a) coswj 

is equivalent to a rotation of the frame 0(x v x 2 ) through an angle o) 
about the normal axis through 0. More generally, any succession of 
such rotations o) v a) 2 , . .., (o n will be represented by 

h = P(o>n) />K-l) + + ...+ 0) 2 + (0 1 )l. 

If o) n = CL> n _ 1 =...==(«>! = 2s7r/n, where 8 is any integer, then after the 
complete cycle of rotations the frame is returned to its original position 
and [/>( 2 s 7 r/n)] n = J 2 . Hence the matrices p{ 287r/n) are all nth roots of I 2 . 


(ii) Determination of the Axes of a Central Conic . The equation of 
a conic referred to axes 0(x v x 2 ) through the centre may be written 
x f ax = 1, where a is a symmetrical square matrix. When referred to 
any other frame of reference 0(x v x 2 ) through O the equation becomes 
x'bx = 1, where b denotes the symmetrical matrix lal~ l or lal' (see 
also § 1*15). If I is chosen so that 6 is a diagonal matrix, say 

6 = [l/a? 0], 

L 0 l/all 

then and a 2 will be the semi-axes. In this case 

a = l^p/af 0 11, 

L 0 l/a|J 

so that 1/a? == A* and 1/a? = A 2 are the latent roots of a (see §3*6). 
Moreover l u and I 12 will be proportional to the elements in any column 
(or row) of the adjoint of A x I — a. The directions of the axes are then 
given by cot <j> = I u /I 12 . 

For instance, if the given conic is 


Zx\ 4- 2x x x 2 + 3x? = 8, 


then 


a — 


[II]- 


and the latent roots of this matrix are found to be A x = J and A 2 = 
Hence the semi-axes are a x = ^/2 and a 2 = 2. Again, the adjoint 
XI—a is if8A-3 1 

1 


i[8A-3 1 ], 

L 1 8A-3J 


l 

of 


and the rows are proportional to [1,1] for A x = | and to [—1,1] for 
A a = Hence, if the axis Ox 1 is taken to correspond to A x = we 
have cot 0 = 1. The inclination of the minor axis is thus 45° to Ox v 
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8-3 


8*3. Angular Coordinates of a Three-Dimensional Moving 
Frame of Reference. To specify the (absolute) orientation of a 
frame of reference in three dimensions three independent parameters 
are required. These may be chosen in various ways. The method 
which is usual in investigations concerning the motion of aeroplanes* 
is a modification of the system of “angular coordinates” originally 
introduced by Euler. 

As we are at present concerned only with the question of orientation, 
we may temporarily suppose O and 0 in Fig. 8-1*1 to be coincident. 



Fig. 8-3-1 


With O as centre, draw a sphere of unit radius to intersect the fixed 
and the moving sets of axes in X v X 2 , X 3 and X v X 2 , X 3 (see Fig. 8*3*1). 
In the original system of Eulerian coordinates the frame 0(X V X 2 , X 3 ) 
is viewed as displaced to the position 0(X V X 2 , X 3 ) by three successive 
rotations which are not actually represented in Fig. 8*3*1. Firstly, a 
rotation about OX 3 brings X 2 to some intermediate position X 2 ; a 
rotation about OX 2 next brings X 3 to X 3 ; lastly a rotation about OX 3 
brings the frame to its final position. In the modified system (specified 
in Fig. 8*3*1) the successive rotations are <f > 3 , <f> 2 , <f> v respectively, about 
the successive carried positions of the axes OX 3 , OX 2 , OX v For brevity 


* See, for instance, p. 261 of Ref. 27. 
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a succession of rotations such as that represented in Fig. 8 * 3*1 will be 
called a sequence of rotations (f> 3 , <f> 2 , <}>i about the carried axes taken 
in the order OX 3 , OX 2 , OX v 

8*4. The Orthogonal Matrix of Transformation. The com¬ 
ponents § 2 , £3 and £ 2 , g 3 of any vector measured respectively 
parallel to the fixed axes and to the moving axes in Fig. 8 * 1*1 will be 
connected by the linear substitution 




i 

= *?, 


in which 

1 = 

hi 

*12 

*13' 



*21 

*22 

*23 



_*31 

*32 

*33_ 


and l 8l , l 82 , l 83 denote the direction cosines of the typical moving axis 
0X 3 referred to the fixed axes. More generally, if M is a (3,n) matrix 
representing the components of any n vectors parallel to the fixed 
axes, and if M is the corresponding matrix of vector components 
parallel to the moving axes, then M = IM. In particular, suppose M 
to be the unit matrix I 3 corresponding to the three unit vectors parallel 
to the moving axes. Then it is easy to see from Fig. 8 * 1*1 that M = V , 
so that / 3 = IV. The matrix of transformation is accordingly ortho¬ 
gonal. Particular properties to be noted are that 1I j = 1 , and that, 
since l' is also the adjoint of Z, every element of l equals its own cofactor. 


8*5. Matrices Representing Finite Rotations of a Frame of 
Reference. Suppose the frame 0(x v x 2) x 3 ) in Fig. 8*1*1 to be rotated 
about the axis 0x 1 through an angles to a new position 0(x v x$,x£), 


and let l become 


*1 = 


*n *12 1 


11 

lli 


12 

*22 

*3*2 


*13 

*£ 

I* 


Then since 0x$ is perpendicular to 0x v and inclined at angles (0 and 
\i t — g) to 0x 2 and 0x 3 , respectively, we have by the properties of 
direction cosines 


and similarly 
It follows that 


Wx,n. *23} = {0> cos to, s in to}, 
l{l*v l *2> *33} = -sinto, cosw}. 


ll[ = I 

*11 

*21 

_1 

= 

"1 

0 

0 " 


*12 

*^ 

*32 


0 

COS ( 1 ) 

— sin ( 1 ) 


*13 

*23 



0 

sin o) 

COS (i)_ 
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whence by transposition and use of the orthogonal property, 

= ri o oil. 

0 coso> sin a; 

0 — sin coboj_ 

Writing this relation as l x = p x (o)) I, .(1) 

we see that the premultiplication of I by p x (o)) corresponds to the 
rotation of the frame 0(x v x 2 , x 3 ) through an angle (o about the axis 
Ox v Similarly, the premultipliers 

p 2 (co) = I"cos co 0 —sin oj 
0 10 
sin to 0 cos co 

cos (o sin o) 

— sin w cosw 
0 0 

correspond to rotations cj of the frame about Ox 2 and Ox 3 , respectively. 
The matrices p v p 2 , p 3 have the characteristic orthogonal property 
p- 1 = p'\ also it can be verified that 

dp[_ ro o 01. dp^_ r o o i 

Pl d(0 “ 0 0 - 1 * P *d(t) ~~ 0 0 0 

_0 1 OJ L-1 0 0 




dpji _J”° — 1 0’ 

p3 d(i)~~ i o 0 

.0 0 0 


( 2 ) 


If the frame is given a sequence of three rotations o) 3 , (o 2 , (o v the first 
being about Ox 3 and the others being about the successive carried 
positions of Ox 2 and Ox v then 

h = Pi{o>i)pMPz(<»s) l .(3) 

where l x now is the matrix of transformation for the final position. 
The order of these rotations is, of course, not commutative in general. 
However, in the special case where the angles of rotation are infinitesi¬ 
mally small, say Sa ) 3 , So) 2 , Sa) v then the last equation gives 


I + ST = 1 0 

— So) 3 1 0 

0 0 1J 


1 0 
0 1 


0 — S(x) x 



1 0 — S(i) 2 

0 1 0 

S(i)n 0 1 


l 






ANGULAR VELOCITIES 


= r 0 


&l)g ~ 


I — $(0$ 0 S(0^ I 

L 8(l)^ — S(l)i 0 J 

The order in which such infinitesimal rotations are performed is thus 
immaterial. If the total time of the operation is $t> then in the limit 
(see also (8*2*2)) 


dl 

dt “ dt l ” 


0 "A 1 =w, say, 
Pa 0 -Pi 

-Pa Pi 0 . 


in which p v p 3 , p 3 are the instantaneous angular velocities of the 
moving frame about the axes Ox v Ox v 0x 3 , respectively. 


Examples 

(i) A Property of the Matrix m. It can at once be verified that, if 
y v Vi, y 3 are arbitrary, then 

0 -p 3 p 2 "I Tyil + T 0 -y 3 yA VpA = 0. 

Pa 0 -Pi Vi Va 0 ~Vi Pi 

.-Pi Pi 0 JUsJ Y-Vi Vi oJLps. 


This identity will be useful later. 

(ii) Postmultiplications Representing Rotations. Equation (1) can be 
written alternatively as = Icrfoj), where 

.( 6 ) 

Direct multiplication of the product Vp x (<t>) l yields the somewhat 
cumbrous expression 

CT-^u) = COS (i) I 3 + (1 — COS Oj) Hu, li2, lis} [Iji, ll2 ,IjjJ 

4" sin 0 ) 0 Ijj Ijg • 

—1 13 0 l ll 

. ^12 — ^11 0 . 

A more elegant form, whose verification may be left to the reader, is 

<r 1 (o>) = (B')- 1 B, .(7) 

£s[ 1 f 13 tan %a> —1 12 tan \(o\. 

— Z 13 tan 1 l u tan Jw 

f 12 tan — l n tan J oj 1 


where 







254 


FINITE ROTATIONS 


8-5 


(iii) Rotation of Frame 0(x v x 2 , x 3 ) about a General Axis through 0. 
Suppose the given general axis to be OX l9 and let OX 2i OX 3 be any 
two further axes which together with OX x form an orthogonal frame 
whose matrix is [L ti ]. Now imagine the frame 0(x l9 x 2 , x 3 ) to be rigidly 
attached to the frame 0(X v X 2 ,X 3 ), and take points on the axes 
Ox l9 Ox 2 , Ox 3 at unit distance from 0. Then denoting the columns of 
the coordinates of these points relative to the frame 0(X l9 X 2 , X 8 ) as 
£, V> £> we h ft ve by an obvious application of (8-4-1) 

K,^n = jLP. 

But the sets of coordinates £, rj, £ are unchanged when the frame 
0(X l9 X 2 , X 3 ) is given a rotation (o about OX v Hence also 

where l x is the matrix appropriate to the new position of the frame 
0(x v x 2 , x 3 ). It follows that LV = p^o^Ll^ which gives 

l 1 = lL , p 1 (a))L. . ( 8 ) 

Hence on application of (6) and (7) 

h == B, .(9) 

where B == V 1 L 13 tan — L 12 tan 1.(10) 

— L 13 tan 1 jL n tan \<o 

Z/ 12 tanked — i n tan|w 1 

Note that if the frame 0(X 1 ,X 2i X 3 ) coincides with 0(x l9 x 2 , x 3 ), so 
that L = I, then the formula (8) immediately reduces to (1). 

(iv) Rodrigues' Formula. This formula states that if a rigid body is 
turned through an angle u) about an axis through the origin O whose 
direction cosines are (L 1V jL 12 ,L 13 ), and if (x lt x 2 ,x 3 ) and (xf,x$ 9 x* ) 
are the coordinates, referred to fixed axes, of any point P of the body 
before and after the rotation, then 

B'x = Bx* 9 

where B is as defined by (10). This formula can be deduced as follows. 
Suppose (x l9 x 29 x 3 ) to be the coordinates of P referred to any orthogonal 
frame of reference fixed in the body and passing through O. Then if I 
is the matrix of this frame, we have by (8-4-1) and (9) 

x = lx = l x x* = I (B^-'Bx*. 

Rodrigues’ formula immediately follows. 






8-6 


ANGULAR VELOCITIES 


255 


8*6. Matrix of Transformation and Instantaneous Angular 
Velocities Expressed in Angular Coordinates. We shall next 
apply (8*5*3) to the particular sequence of rotations represented by 
Fig. 8*3*1. If? now denotes the matrix for the final position, then clearly 


^ ~ Pl(0l) Pz{$2>) 

and l~ l = Paifiz) p 2 ( < f > 2 ) Piifii)* 

The product (1) yields 

l = C 2 C 3> ^2 5 3> ^2 > 

• ~” c 1^3i" 5 l 5 2 C 3> ^1 C 3 4* ^1^2^3> 

^34*CiS 2 c 3 , ^1^3 4-<+83 ^3> c 1 c 2 _ 


(1) 

( 2 ) 

.(3) 


where c if s i are abbreviations for cos <f> iy sin <f>^ This formula gives the 
direction cosines of the moving axes explicitly in terms of the angular 
coordinates of the frame. 

Corresponding formulae for the instantaneous angular velocities 
p v Pz> P 2 (see § 8*5) can be obtained as follows. Using (8*5*4) in con¬ 
junction with (1) and (2), we have 

m - 'ir " 


which on application of (8-5*2) gives 


w — 03P1P2 

“0-1 0" 

P 2 P 1 4~ 02^1 

1 

i-H 

O 

O 

1_ 

Pi + 01 

0 

0 

0 


1 0 0 


0 0 0 


0 0-1 


° 0 0 


-1 0 0 


.0 1 0 


After some reduction this yields 


m = 


0 , -c 2 Ci<f> 3 + 8 1 (f> 2y 

^2^103 ^102» 

-‘C 2 S 1 ( f) 3 — C 1 ( />2j “" 5 2^3 4'^1> 


Hence on comparison of (4) and (8*5*4) 


^2^103 4" ^102 * 

S 2 < Pz 01 
0 


Pi ~ 0i ““03 sin 0 2 , 

p 2 = 02 cos 0i 4" 03 s ^ n 0i cos 02> 

p z = — 0 2 sin (f> 1 + 03 cos <f> x cos 0 2 .„ 


These relations may be written conveniently 

P = i?0, 


(4) 

(5) 

(6) 
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where fl 0 — sm0 2 1. 

0 cos 0 X sin 0 X cos 0 2 
0 — sin^! cos <f> x cos 0 2 _ 

The reciprocal of R is 

jR-i = sec 02f cos 02 sin 0 X sin 0 2 cos 0 X sin 0 2 "1. 

0 cos 0! cos 02 — sin 0 X cos 0 2 
0 8^0! COS 0 X 

Similar methods may be used with any other system of angular 
coordinates. 

8*7. Components of Velocity and Acceleration. As in § 8-4 

let 5 and £ represent, respectively, the components of any vector 
measured parallel to the fixed axes Ox v Ox 2 , Ox 3 and to the moving 
axes 0x lf 0x 2 , 0x 3 . Then by differentiation of (8-4*1) with respect to t 
and application of (8-5-4) it is seen exactly as in § 8-2 that the com¬ 
ponents of the time-rate of change of the vector, measured in the 
directions of the moving axes at time t, are given by the operation 
ID + w performed on £. 

In particular, let the column x denote the coordinates of a general 
point or particle P at any instant relative to the moving axes; let u , 
v denote respectively the components of the absolute velocity of O 
and of P; and let a denote the components of the acceleration of P, 
all measured in the directions of the moving axes. Then as in § 8-2 


v = u + (ID + w)x .(1) 

and a =* (ID + w) v = (ID + vj)u + (ID + vj) 2 x .(2) 


If P is a point of a rigid body and the axes 0(x v x 2 , x 3 ) are fixed in 
the body (i.e. are body axes), then the coordinates x do not change as 


the body moves, and equations (1) and (2) reduce to 

u = u + mx, .(3) 

a = {ID 4- w) u -f- {Dvj + xu 2 ) x, .(4) 

Equation (3) can be written alternatively as (see (8-5-5)) 

v = u-xp, . (5) 

where x=\ 0 x 2 1- 

I #3 0 —#i| 


[. — #2 *1 ® J 

Expressions for the matrices I and w in terms of the angular co¬ 
ordinates 0 of the moving axes are given in § 8-6. 
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Examples 


(i) Kinetic Energy of a Rigid Body. The kinetic energy T is given by 
2T = +v\ +v\) = 

where m is the mass of a typical particle and £ 0 denotes summation 
for all the particles. If the moving axes are fixed in the body, then by (5) 

2T = S 0 m(u' 

= Z 0 m(u , u-2u'xp-p , X 2 P)> 

since yf = — % and u'xp = p'x'u • further, 0 is the centre of mass, 
so that = 0, then 

2T — Mu'u+p'Jp, 


where M is the total mass and J denotes the symmetrical square matrix 


J= -2 0 mx 2 == 


2 0 m(:rf-f£§) — E 0 mx x x 2 — 'E 0 mx l x z 

- l, 0 mx 2 x 1 Z 0 m(£§ + a;f) — 2, 0 mx 2 x 3 

L -Z 0 m^ 3 a ; 2 S 0 m(^ + xi)J 


.( 6 ) 


(ii) Angular Momenta. If h 2i h z are the angular momenta about 
body axes 0(x v x 2 , # 3 ) through the centre of mass, then it is readily 
shown that h = Jp 

where J is given by (6). The components of the time-rate of the angular 
momentum are accordingly 

$ = (ID + m) Jp. .(7) 


(iii) The Principal Axes and Moments of Inertia. Suppose the 
moments and products of inertia of a rigid body appropriate to the 
axes Ox v Ox 2 , Ox z (see Fig. 8* 1-1) to be known. It is required to find 
the corresponding constants for another set of axes Ox v Ox 2 , Ox z , 
derived by the matrix of transformation I. In particular it is required 
to find the principal axes, for which all the products of inertia vanish. 


Write 




2 0 ma?f 

Sq rtxx 2p^x 
l'£ 0 tnx 3 x l 


'L 0 mx 1 x z 
2 0 mic| 
S 0 TtlXgX 2 


XqTH«2^2*^'3 
S 0 maj§ _ 


and let 0 be the corresponding matrix with the coordinates x sub¬ 
stituted for x. Then 


0 = 2 0 mxx' = 'Z 0 mlxx'l~ 1 = IGI -1 . 
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If the new axes are principal axes, O reduces to the diagonal matrix 


0 0 

0 Zotttti 0 

0 0 I^nixf 

s ® 0 , 

0 ^(A z ^-A^^ A 2 ) ® 

. 0 0 J(4i+4a—A a )_ 

where A v A 2i A z are the moments of inertia about the principal axes. 
Hence the diagonal elements in the last matrix are the latent roots 
Ajj Aj, Ag of C?i so that A^ = Ag 4" Ag, Ag = Ag - !” Aj and A z = Aj -f" Ag. 
Moreover, the first row of elements in I (which yield the direction 
cosines of the principal axis corresponding to AJ will be proportional 
to any row of the adjoint of A X I — G. The other two rows of I are 
given similarly. 

A similar method can be used to find the axes of a central quadric 
(see also example (ii) of § 8*2). As a numerical illustration suppose the 
given quadric to be x'ux = 1, where 


u 


" 1 0*5 0*5' . 

0*5 1 0*5 

0*5 0*5 1 


The latent roots of u are Ax = 0*5, A a = 0*5 and A g = 2, so that the 
equation represents an ellipsoid the semi-axes of which are <J2, <J2 and 
1/^2. The direction cosines of the short principal axis are (1/^/3, 
1/V3, 1/^/3), which are taken proportional to the first row of the 
adjoint of A 2 I — u. For the equal roots A x and A a , the adjoint A 1—u 
is null, and the directions of the corresponding equal axes are obviously 
indeterminate. 


(iv) Homogeneous Strain . Let x represent the coordinates of a point 

•*“* write X = ax. .(8) 

when a is a square matrix of given constants. Then the field of points 
X is said to be derived from the field x by a homogeneous strain. An 
equivalent statement is that X is a linear vector function of x. 

Take three points at unit distance firom the origin, and let the three 
lines joining the points to the origin be at right angles. Then the three 
columns formed by the coordinates of the points ban be combined into 
a single orthogonal square matrix S(x). A problem which sometimes 
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arises is to find 8(x) so that the square matrix S(X) of the trans¬ 
formed points shall have the general orthogonal property 

S'(X)S(X) = d, . (9) 


where d is a diagonal matrix. In other words, it is required to find a set 
of three mutually perpendicular directions which remain mutually 
perpendicular after the strain. 

By equation (8) it follows that 8(X) = aS(x). Hence the condition 
(9) becomes S'(x)a'aS(x) = S~ l {x)bS{x) = d , 

where b is the symmetrical matrix a'a. Accordingly S(x) can be 
identified with the modal matrix k of 6, and d is the diagonal matrix 
of the latent roots of 6. 

In the special case where a is symmetrical, 6 = a 2 , and then b and a 
have the same modal matrix. Hence X = Ax , if x is identified with a 
column of k , and A is the corresponding latent root. Thus the lines 
joining the points x and X to the origin have the same directions, so 
that the set of three mutually perpendicular lines have the same direc¬ 
tions before and after the strain. In this case the strain is said to be 
pure. In general (8) represents a pure strain combined with a rotation. 

8*8. Kinematic Constraint of a Rigid Body. The position of 
a rigid body at any instant is defined uniquely by the position of any 
convenient rectangular frame of reference 0(x v x 2 , x 3 ) fixed in that 
body. Now the position of the body axes can be specified by six para¬ 
meters—for instance, the Cartesian coordinates a of 0, referred to fixed 
axes, and three angular coordinates (f >. Hence, unless definite relations 
are assigned between the six parameters, the rigid body has six degrees 
of freedom. These degrees of freedom may be taken to correspond to 
the positional coordinates just mentioned, or to any other equivalent 
set. If, on the other hand, relations are assigned between the posi¬ 
tional coordinates, the body will be subject to geometric or kinematic 
constraint , and it will then have less than six degrees of freedom. 

A very simple example of constraint would be fixture of one point 
of the body. Again, a point might be restricted to lie on a fixed surface, 
or to lie on a surface or curved guide which is itself forced to move in 
a prescribed manner. These constraints can all be represented by 
functional relations connecting the positional coordinates and possibly 
also the time variable t. 

More general types of constraint can occur in which sliding or rolling 
contact is imposed between the body and fixed or movable guides. 
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The restricting conditions may then consist of relations involving the 
time-rates of change of the positional coordinates. When these relations 
can be integrated without further knowledge of the motion, so that 
they may be replaced by equivalent conditions explicitly connecting 
the positional coordinates and possibly the time variable, the con¬ 
straint is said to be holonorn&us* When the kinematic conditions are 
not explicitly integrable the constraint is non-holonomous . 

Example 

Non-Holonomous Constraint . Suppose a sphere of radius R and 
centre 0 to roll (without sliding) on the fixed plane x 3 =» R. Choose 
auxiliary rectangular axes 0(x v x 2 , x 3 ) carried by the sphere and 
initially coincident with the fixed frame of reference 0(x v x 2 ,x 3 ). 
At time t let O be at (a v a 2 , 0) and let the angular coordinates of 
the sphere be 0 2 ,0 3 ( see Fig. 8-3-1). Then if x and x denote the 
columns of the coordinates of a general point P of the sphere at time 
t , referred to the fixed axes and the body axes respectively, we have by 
(8*4*1) x = l(x — a), where I is given by (8*6*3). Moreover, since P 
moves with the sphere we have by differentiation 0 = l(x — a) + i(x —a). 
Now if the point considered is in contact with the plane at time t , 
then x = {a v o 2 , i£}. Moreover, since there is no slip at the point of 
contact, the total velocity components of P must vanish, so that 
dk = 0. Hence the equation of constraint is 

l{d 1 , fl 2 > fi} = 

If (8*6*3) is used to express I and l in terms of the angular coordinates, 
the last equation yields the two independent conditions 
d x cos <j> 3 + d 2 sin <f> 3 = — R(f> 2 , 
d x sm —* o 2 cos ^2 —- R<j> x cos ^ 2 * 

These are non-integrable, so that the constraint imposed is nbn- 
holonomous. The sphere has three degrees of freedom so long as it 
remains in contact with the plane and no slipping occurs. If there were 
nothing to prevent the sphere from leaving the plane on one side, the 
constraint would be described as “one-sided”. 

8-9. Systems of Rigid Bodies and Generalised Coordinates. 

The conception of positional coordinates can be generalised to apply 
to any system of rigid bodies or particles. Such a system may include 

* Also holonomic. The term was introduced by Hertz to denote a constraint expressed 
by integral—as distinct from differential—relations. 
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certain bodies which, though possibly kinematically constrained, 
nevertheless have no direct geometric linkage with other members of 
the system: more usually the various bodies will be connected together 
in some way. It is evident that any possible configuration of the whole 
system can be specified completely by means of a suitable set of 
parameters or generalised coordinates , whose number will depend on 
the number of the bodies and of the kinematic constraints. 

The equations which represent the constraints mathematically will 
in general consist of a set of relations between the generalised co¬ 
ordinates, and the time-rates of change of these coordinates. We shall, 
however, be mainly concerned with cases in which these equations are 
directly integrable, so that they yield an equivalent set of relations 
explicitly connecting only the generalised coordinates and possibly 
also the time variable t. Restricting attention to such holonomous 
systems, we see that the configuration at any instant can be com¬ 
pletely specified by a set of s generalised coordinates, in conjunction 
with, say, r independent relations connecting these coordinates and 
possibly also involving t. The total number of effectively independent 
generalised coordinates will thus bem = 5-r, and will equal the total 
number of degrees of freedom of the system. 

In the case of the single rigid body considered in § 8-7, a typical point 
P of the body was identified by means of its Cartesian coordinates x y 
referred to a set of axes carried by the body. The Cartesian coordinates 
x of P , referred to the fixed frame of reference 0(x ly x 2 , x z ), were then 
expressible in terms of x and the positional coordinates a and <f> of 
the body. Similarly, with a system of rigid bodies, it will be possible 
to identify a particular point or particle P of the system by a set of 
parameters (corresponding to x) and to express the Cartesian co¬ 
ordinates x of P, referred to fixed axes, in terms of those parameters 
and the generalised coordinates of the system. Suppose the set of 
parameters relevant to P to be denoted for convenience by the single 
symbol a, and as usual let q v q 2 ,..., q m be the m effectively independent 
generalised coordinates of the system. Then the functional dependence 
of x on a, q (and possibly also t) will be expressible as 

x i — /l(?l> ?2> • • • > a )> 

X 2 ~ f 2(*?1> ?2> • • • > ?m> a )> 

== V Q2> • • • > a )> 

or more briefly as x =f(q v q 2 , ... y q my t y a) y .(1) 
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where x, f denote column matrices. These equations will be referred 
to as the geometrical equations of the system. 

Since the parameters a are assumed to be invariable with time, 
equation (1) yields on total differentiation with respect to t 


x = 


d l + ya*L 


( 2 ) 


The quantities are termed the generalised components of velocity or 
simply the generalised velocities . Similarly, the quantities q t are spoken 
of as the generalised accelerations . 


Part II. Statics and Dynamics of Systems 

8*10. Virtual Work and the Conditions of Equilibrium. 

The conditions of equilibrium of a system are most conveniently 
obtained by an application of the principle of virtual work. 

If a system of forces which act on a single particle is in equilibrium, 
then no work will be done in any displacement of the particle, provided 
that the forces remain constant in magnitude and direction during 
the displacement. Conversely, if the work done, as calculated on the 
foregoing assumption, is zero in any possible displacement of the 
particle, then the forces acting on the particle must be in equilibrium. 
It is true that in most actual cases the forces do vary as their point of 
application is moved, but for the purpose exclusively of a test for 
equilibrium they must be supposed to remain constant. Any actual 
changes in the forces due to motion of the point of application affect 
stability, but not equilibrium. 

Next consider a system of particles, connected or unconnected. It 
would be possible to test the equilibrium of each particle separately 
by a calculation of the work done in an arbitrary displacement. Since 
in such a calculation the forces, if any, between the particles them¬ 
selves would have to be included, the method would offer no practical 
advantage. However, suppose all the displacements considered to be 
infinitesimal and such that they do not violate any of the rigid con¬ 
nections or Motionless constraints of the system: then it is easily 
shown* that the forces corresponding to such connections or constraints 
do not enter into the expression for the virtual work of the system 
as a whole. 


♦ See, for instance, p. 167 of Ref. 28. 
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The time variable does not enter into the discussion of static 
equilibrium. Any given configuration of the system is assumed to be 
completely specified by the m generalised coordinates q, and our object 
is to decide whether that particular configuration satisfies the con¬ 
ditions of equilibrium. Suppose *1. *3 to represent the com¬ 

ponents of a force applied at the point whose coordinates are x v *.»*s 
and let X, x denote the corresponding columns. Then the total work 
done in the displacement is given by 

SW = S 0 X'Sx, 

where S 0 denotes summation extending to all the particles and applied 
forces. If the virtual displacement given corresponds to increments 
Sq { of the generalised coordinates, then by (8-9*1) 

m 3 f 

to - 2 Sq<w~, 

<-l v<li 

and the total work can accordingly be expressed as 

SW = P'8q= (Sq')P, 

where Sq denotes the column of the increments 8q t , and P represents 
the column of the quantities 

P-YX'V 
* ~ 0 dq, 9 

For equilibrium S W must be zero for all possible virtual displacements, 
i.e. for all ratios of the increments dq t . The condition for equilibrium 
is thus p __ o 

It is usual to call P f the generalised component of force (or more briefly 
the generalised force) corresponding to q if but it should be noted that 
P i has not necessarily the physical dimensions of a force. For instance, 
if q f denotes an angle, P { has the dimensions of a moment. 

8-11. Conservative and Non-Conservative Fields of Force. 

Certain of the forces applied to the system may have the distinctive 
property that the work done by them when the system is given a 
displacement depends solely on the initial and the final configurations. 
Such forces are said to be conservative. 

Suppose the generalised coordinates q to have the values Q for some 
datum configuration. Then the work done by the conservative forces 
when the system is displaced from q to Q will be a definite function V 
of the coordinates q which is termed the potential energy. It follows 
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that the work done by such forces in an arbitrary infinitesimal dis¬ 
placement 8q is given by 8W = — (8q f ) ||^| , where ||~| denotes the 

column . Hence if the part of the typical generalised 

8q% oq m ) 

force arising from the non-conservative forces be P iy then the total 
generalised force will be given by 


Pi-Pi- 


dv 

d 9i ' 


.(i) 


If the field of force is wholly conservative, then the conditions of 

equilibrium are j|~J = 0. These require the potential energy to be 

stationary. If further the potential energy is a minimum for the 
equilibrium position, this position is stable, f 

An important special case is that in which V is a homogeneous 
quadratic function of the coordinates q. The potential energy will be 
expressible in this way when the values of the coordinates appropriate 
to the position of equilibrium under zero external load are chosen to 
be zero and the displacements from this position are always small. Then 

V-WEq, 

where E is a symmetrical square matrix. Hence (compare (2*8-3)) 

i) -. (2) 

so that the conditions of equilibrium are briefly 


Eq = P. 

In systems where aerodynamical forces play a part, it usually happens 
that, while these forces are not conservative, yet they are linearly 
expressible in terms of the displacements by means of aerodynamical 
derivatives.\ Suppose the contribution of these forces to P to be 
written — Wq+w, and let P* be the part of P due to the remaining 
forces, if any. Then the typical condition of equilibrium is 


Cq = w+P *, 

where (7 s W + E. The square matrix G (which is not in general sym¬ 
metrical) is termed the stiffness matrix , and its elements are the 
stiffness coefficients , or simply the stiffnesses. Each stiffness is the sum 
of two terms, namely an aerodynamical stiffness which originates from 
the air forces, and an elastic stiffness which actually represents the 


t See, for instance, § 86 of Ref. 29. J See also § 9*6. 
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influence of all the conservative forces. Provided 0 is not singular*]* 
the last equation may be premultiplied by ® s C - * 1 to give 

q = <b(w + P*). 

The matrix <J> is called the flexibility matrix , and its elements are the 
flexibility coefficients , or the flexibilities.% It will be noted that w 
represents the generalised wind forces appropriate to the configuration 
for which the coordinates q are zero. 


Examples 


(i) Reciprocal Theorem for a Conservative System . Suppose any two 
external load systems ^(1) and *P(2) to be applied, and let the corre¬ 
sponding displacements be #(1) and q(2). Then the conditions for 


equilibrium after either displacement are — 
(2), since E is symmetrical, 


dV 

9 ?i 


= 0 . 


Hence by 


?'(1)TO =q'(\)Eq[2) = q'(2)Eq(l) = <?'(2)$(1). 

These reciprocal relations are valid for any number of degrees of 
freedom, and the theorem is in fact true for a continuous elastic body. 

(ii) Principal Directions of Loading at a Point of an Elastic Body or 
Structure. A principal direction at a point P is such that a load applied 
to P in that direction displaces P in the same direction. Let unit 
force applied in the direction Ox x give component displacements 
8 n , 8 21 , 8 31 : and similarly, let the displacements due to unit forces 
parallel to Oa? 2 , Ox% be respectively 8 12 , 8 22 , 8 32 and 8 13 , 8 23 , 8 33 . By 
Hooke’s law a force with components X v X 2 , X z will produce a dis¬ 
placement whose components d v d 2 , d 3 are given by {d J = [8 i; ] {XJ, 
or say d = 8X. Now for a load in a principal direction we require 
d = AX, where A is a scalar multiplier: hence 


(AI-8)X = 0. 

It can be shown that the principle of the conservation of energy will 
be violated unless 8 is symmetrical. For example, if the point P be 
made to describe a closed rectangular path with its sides parallel to 
Ox x and Ox 2) then the whole work done in the cycle of displacement 
will not be zero unless 8 12 = S 21 . It then follows from the orthogonal 
property of the modal matrix of a symmetrical matrix (see example 


t When C is singular there is no position of equilibrium unless w + P* = 0, in which case 
the system is neutral. 

J The flexibilities are also referred to as influence numbers in the theory of beams. 
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(v) of §3*8) that there are three mutually perpendicular principal 
directions at any point of the structure. The proof given shows that 
this conclusion follows merely from Hooke’s law and the principle of 
the conservation of energy. 

8*12. Dynamical Systems. By a dynamical system is meant 
one or more rigid bodies, which, ideally, may be mere particles. The 
bodies may be quite independent, or they may be subject to constraints 
of the types already discussed in Part I of this chapter. They may 
also influence one another by direct communication of momentum by 
impact, or by communication of momentum at a distance by gravi¬ 
tational or electro-magnetic forces. In the last category may also be 
included the effects of ideal massless elastic links, and—with certain 
restrictions—the reactions of a perfect fluid in which the bodies are 
immersed. 

A continuous elastic body is not a dynamical system according to 
the foregoing definition. Nevertheless it is possible to consider such 
a body as the limit of a dynamical system having an indefinitely large 
number of particles with ideal massless elastic links between the 
particles. Moreover, it is often convenient to suppose a continuous 
elastic body to be replaced by a body having only a finite number of 
definite modes of deformation. Such a body is said to be semi-rigid : it 
is a true dynamical system possessing a finite number of degrees of 
freedom. For example, a semi-rigid cantilever beam capable of bending 
according to a given law of flexural curvature can be imagined as a 
system of infinitesimal rigid rods so geared together that their dis¬ 
placements accord with the given law; such a semi-rigid beam would 
have one degree of freedom. Naturally a semi-rigid body will not 
behave in the same way as its elastic counterpart in all circumstances, 
but its behaviour may sometimes be exactly the same, provided that 
the modes of distortion are properly chosen. The chief value of the 
conception of the semi-rigid body is that it permits approximate 
calculations of reasonable accuracy in cases where exact treatment is 
not mathematically feasible. For instance, the differential equations 
governing the oscillations of an aeroplane wing moving through the 
air are extremely complicated, but the stability of the wing can be in¬ 
vestigated approximately on the supposition that the elastic wing is a 
semi-rigid body possessing only two degrees of freedom corresponding 
to flexure and torsion.* 


* Seep. 7 of Ref. 30. 



8*13 


MOTION OF AN AEROPLANE 


267 


8*13. Equations of Motion of an Aeroplane. Before con¬ 
sidering dynamical systems comprising several bodies, we shall express 
in matrix form the equations of motion of a single rigid body. The 
method of construction of the equations is quite general, but for the 
sake of definiteness we shall suppose the body to be a (rigid) aeroplane 
in flight. The notation differs in some respects from that which is 
standard in aerodynamics,* but the necessary scheme of conversion 
will be stated at the end of the section. 



In Fig. 8-13-1 the frame 0(x v x 2 ,x 3 ) is fixed relative to the earth 
with Ox 3 vertically downwards, while Ox v Ox 2 , Ox 3 are body axes 
through the centre of mass 0 of the aeroplane. The reader should 
imagine that he is viewing the aeroplane in the diagram from below. 
The axis Ox x is longitudinal and directed forwards, Ox 2 is drawn 
laterally to starboard, while Ox 3 is in the plane of symmetry and 
downwards. The six parameters used to describe the position of the 
aeroplane at any instant are to be regarded as the three rectangular 
coordinates of 0 referred to the fixed axes, and the three angular 
coordinates <f> v <j> 2 , <j> z defined in § 8-3. 

Let m denote the total mass of the aeroplane, and let mG, ml 
denote, respectively, the columns of the components of the weight 
and of the total remaining applied forces, measured in the directions 


♦ Chap, v of Ref. 27. 
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of the body axes. Al so let L be the column of the moments about the 
same axes, and u, p the columns of the components of the linear and 
the angular velocities. 

Applying (8*7*4) and (8*7*7), and remembering that 0 is the centre 
of mass, we obtain the six equations of motion in the form 


( ID+w)u = X+G , .(1) 

(ID+m)Jp = L, .(2) 


where m and J are as defined in §§ 8*6 and 8*7, respectively. 

The expression for O in terms of the angular coordinates* is 
given immediately by equations (8*4*1) and (8*6*3). Thus, if g is the 
acceleration due to gravity, 

mO = J{0,0 ,mg}, .(3) 

or Q = gn, 

where n x = — sin0 2 ; n 2 — cos <j > 2 sin n 3 — cos <j > 2 cos <f> v 


Lastly, the angular velocities p are themselves given by 

p = R<i>, W 

where R is as defined in § 8-6. 

As a special case assume the aeroplane to be in steady motion. Then 
u, p, X and L are columns of constants, and equations (1) and (2) 
reduce to mu = X+gn, .(6) 

mJp = L. (0) 

Prom (5) it follows that n is a column of constants, so that <f > 2 and <f> x 
are constant. Equation (4) accordingly simplifies to 

p = Cln, (7) 

where also is a constant. Hence in steady motion 


p'p = ilhi’n = Q 2 . 


The simplest type of steady motion, of course, is that of steady 
rectilinear symmetrical flight. Then with £1 — 0 and <j> 1 — 0, the 
conditions (6) and (6) reduce to 

X = ?{sin <f> 2 , 0, cos0 2 }, 

L = 0. 


* In relation to the present problem the angular displacements <t> 9 , <f> 3 , 4> x represent 
rotations in yaw , pitch and roll, effected successively about the carried positions of the 
body axes. 
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Addendum. The scheme of conversion to standard aeronautical 
symbols is as below: 


Matrices 

Used 

x 

u 

X 

L 

P 

m 


J 


Equivalent in 
Standard Symbols 

{x,y,z} 

{u,v,w} 

... {.X,Y,Z} 

... {L, M, N} 

{V,q,r} 

... {*,*.*} 

... T 0 — r q 

r 0 —p 
-q p 0 . 

- A -F -E~ 
-F B -D 
-E -D C 


8*14. Lagrange’s Equations of Motion of a Holonomous 
System. Suppose x to represent the Cartesian coordinates, referred 
to fixed axes, of a typical particle of mass m. The system will be 
assumed holonomous, so that these coordinates will be expressible 
in terms of the generalised coordinates q ( , and possibly also the time 
variable t, by the geometrical equations (8*9-1). If X denotes the 
column of the components of force acting on m, then the equations 
of motion for the particle are expressed by 

m£ = X. .(I) 

This equation states that the applied force is in equilibrium with the 
reversed effective force.* 

Premultiply equation (1) by the row matrix 

dr rdf, du 34-1 

L3?i’ 9?<J 

and sum for all particles of the system. Then 

= .< 2 > 

where E 0 denotes summation for all the particles. 


* In engineering it is common to speak of the reversed effective force as the inertia force. 
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Consider, firstly, the expression on the right of this equation. This 
is formally identical with the quantity P i which was derived in § 8*10 
by an application of the principle of virtual work-and which was 
termed the generalised component of force, provided it is understood 
that in the virtual displacements of the dynamical system the time 
variable t (which may enter explicitly into the geometrical equations 
of the system) is not varied. Forces which do no work* in the type of 
displacement contemplated, and which therefore do not appear in p^., 
include forces between the particles of a rigid body, pressures or 
tensions in inextensible connecting members, the reactions at fixed 
pivots or fixed smooth (or in some cases perfectly rough) guides, and 
lastly the reactions at any smooth guides which provide kinematic 
constraint and are moved in some prescribed manner. These last 
forces, in particular, will be absent from P i3 since the guides concerned 
are not moved in the virtual displacement. 

The expression on the left of equation (2) represents the appropriate 
generalised component of the effective forces on the system. In 
Lagrange’s method it is very conveniently expressed in terms of the 
kinetic energy function. The essential step is that the total kinetic 
energy of the system, namely 

T = liQ^mx'db, .(3) 

must first be expressed as a function of the generalised velocities g, the 
generalised coordinates g, and the time variable t. This can be done 
immediately by use of the relations (8*9*2). The resulting expression 
for the kinetic energy will clearly be quadratic (though not necessarily 
homogeneous) in the quantities, so that, say, 

T = \q'Aq + q'A + A Q . 


The symmetrical square matrix A , the column matrix A, and the 
scalar A 0 depend in general on the coordinates q and on t, but if the 
geometrical equations do not contain t explicitly A and A 0 will be 
absent. In this special case T will be a homogeneous quadratic 
function of the variables g, and its coefficients will depend only on 
the coordinates q. 


Now by (3), || - 


But equations (8*9*2) give on differ¬ 


entiation 


dq t ~ dq t ’ 


where the generalised velocities are treated for 


* Forces which do no work are sometimes referred to as eonstrainls. 
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the present as distinct variables, independent of the generalised 


coordinates. Hence 


dT V 0 / 

x-r = L Q m±' 
dq t dq t 


Again, since obviously the differential operators %(== J- + 2 

dt\ dt i«i dqj 


and ^ are permutable, it follows that 


d(dT\ _ y df di _ 

dAjdq} ~ ^ + ^ - S 0 m^i5+^. 


Hence finally 




a/ , /tl= d/arxsr 

' \0?J 


dt 


0 ?< 


Lagrange’s equations of motion are thus derived in the form 

d 


dt 


ma-r. 

10?<J I3?ti 


•(4) 


If, as in § 8-11, F denotes the potential energy of the conservative 
applied forces, and P is the contribution to P due to the non-con¬ 
servative forces, then (4) may be written 

‘"MS-* 


.(5) 


dt \dqji 

in which L=T —V. The quantity L, namely the difference of the 
kinetic and potential energies, is referred to as the Lagrangian function 
or the kinetic potential. 

Examples 

(i) Expanded Form of Lagrange's Equations. For simplicity assume 
the constraints to be independent of time. Then if, for instance, there 
are two degrees of freedom, 

2 T = A u q\ +-4 8 2?a + 2.4 
dT 

and on differentiation = A n q l +A u q it bo that 
d ldT\ dT ... . _ /. dA n . dA n \ . 


+ 




0-^18 . • 0-^18 
0?1 ?a 0?8 , 




3^22 •* • • \ 


On rearrangement this gives the first dynamical equation in the 
explicit form 

a » a •• ldA n /&4 12 13^22\ •» 3*4ii • • D 
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Similarly, noting that A ia — A tl , we have 

A A- I A fj I /^21 1 ^ll\ -2 | 1 ®-^22 -a . 3-^22 rf _ p 

More generally, if there are m degrees of freedom, the rth dynamical 
equation will be m mm 

2 -4*,?<+ S S = P r , 


i-ij-i 


in whichf 


o n(r)-dAir dA jr dA {J 

20! »-^ + 8 * 9 ,, • 


(ii) Discriminants of the Kinetic Energy Function . In the case of a 
holonomous dynamical system with constraints independent of time 
the kinetic energy function is a homogeneous quadratic function of 
the type T = \q'Aq. Since the kinetic energy is necessarily positive, 
it follows that I 7 is a positive quadratic form, and that its discriminants 
are all necessarily positive (see §1*16). It should be noted that no 
discriminant of the kinetic energy can vanish, unless the system has 
the unusual property that all its particles can be at rest while some of 
the generalised velocities are not zero. 


8*15.* Ignoration of Coordinates. It sometimes happens that 
certain of the generalised coordinates do not appear explicitly in the 
Lagrangian function of a conservative system though the corre¬ 
sponding generalised velocities are present. Such coordinates are said 
to be ignorable.% 

It will be convenient to suppose that the system has m non-ignorable 
coordinates q v q 2 > an d in addition k ignorable coordinates 

•••> 7 hc Then by definition the function L = T—V is explicitly 
dependent only on the sets of quantities q , q and rj. If the constraints 
are independent of time, the kinetic energy will be a homogeneous 
quadratic function which can be represented by means of partitioned 
matrices as (see example (ii) of § 1-15) 

T = HM] [«. fl] |?1 - lq'«4 + V'ri+W8v .(1) 

Lr. 

The symmetrical square matrices a and 8 are of the orders m and k 
respectively, while J3 and y are respectively of the types (ra, k) and 

f The quantity C$ is sometimes denoted by a “Christoffel’s symbol** jjQ, hut this 

symbol is here avoided as it might lead to confusion with matrices. 

i Other terms in use are cyclic, absent or kinosthenic . See § 38 of Eef. 31, and § 84 of 
Ref. 29. 
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(k, m) with Y = /?. All these matrices are independent of the co¬ 
ordinates 7J. 

The k equations of motion appropriate to the ignorable coordinates 


are 

give 


here .imply |{^J 


= 0. These can be integrated immediately to 
(dT) 

fei y ® + ^ =or> . (2) 


where <r denotes a column of k constants of integration. The velocities 


ij can now be expressed in terms of q and q . Thus 

rf = S- 1 # - S-^q, .(3) 

or, alternatively, since S is symmetrical and y' = /?, 

rj' = cr'd-'-q'fiS- 1 . .(4) 


Next consider the m equations of motion appropriate to the non 
ignorable coordinates, namely 


d IdT 


dt [dq^ 




(5) 


These equations contain explicitly not only the non-ignorable 
variables but also the quantities rf and i). However, by means of (3) 
and (4) it is possible to eliminate rj and i), and so to derive a set of m 
differential equations involving only the non-ignorable coordinates. 
In this way the dynamical system is effectively reduced to one having 
only m degrees of freedom. We shall now show how the equations 
appropriate to the reduced system can be constructed directly. Let 


R = T 1 -r/'<r-V , .(6) 

where T x is the function T expressed in terms of the variables q and q, 
and rj is expressed in terms of the same variables. Then 


\m = [dT 1 \rdrn 

Wi) l3?iJ l_3?i_ 


<r. 


where 



is an (to, k ) matrix. But 


Hence by (2) and (7) 



( 7 ) 
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Similarly, 



The required reduced form of the equations (5) is accordingly 

UJ . () 

The explicit expression for R in terms of q and q is readily found as 
follows.* By direct substitution for Srj from (2) in (1) and use of (6) 
and (4) we hare 

jB + V = iq'aq + \rj'yq - \if<r 

= &'«? + Her 1 *- 1 - q'fiS’ 1 ) (yq - cr). 

Hence JB = lq'(a — 2+ q'P8~ x <r— \<r'8~ x cr— V. 

It should be noted that although equations (8) are similar in general 
form to Lagrange’s equations, yet the function R is not constructed 
in the usual way as the difference of a kinetic energy function and a 
potential function. It may contain terms which are linear in the 
velocities q. 


8*16.* The Generalised Components of Momentum and 
Hamilton's Equations. The kinetic energy of a single particle is 
T = Jmir'ir, and the three components of momentum, say p v p t , p 3 , 
are given by 

'-•"HU- .<*> 


Further, the kinetic energy, when expressed as a function of the 
components of momentum, takes the form 





( 2 ) 


while the equations of motion of the particle (see (8* 14*1)) become 
p = X in conjunction with (1). These equations illustrate in a very 
simple way Hamilton’s equations of motion, which make use of the 
momenta—or in effect the generalised velocities—as auxiliary depen¬ 
dent variables. The construction of Hamilton’s equations for a 
holonomous dynamical system will now be considered. 

By analogy with (1) the m generalised momenta for a dynamical 
system are defined by 






8*16 THE HAMILTONIAN FUNCTION 275 

If the kinetic energy function is 

T = lq'Aq + q'A + A 0 , .(3) 

then p = Aq + A. .(4) 

When, as is normally the case, the symmetrical matrix A is non¬ 
singular, the preceding equation gives by inversion . 

ff = A~\p-A). .(5) 

By the use of (5) and its transposed the kinetic energy (3) can be written 
explicitly as a quadratic function of the variables p . Thus, if the 
function when so expressed is distinguished as T v we have 

T x = W-A')A~\p-A) + {p'-A')A-'A+A» .( 6 ) 


which is the generalisation of (2). 

In Hamilton’s equations the generalised momenta p and the 
coordinates q are used as dependent variables instead of q and q. 
Moreover, the Lagrangian function L is replaced by the Hamiltonian 
function H defined as 


H 


q'p-T+V. .(7) 

As with the kinetic energy, the function H may be expressed either in 
terms of p or of q. In the first case, if (5) and (6) are used in conjunction 
with (7), 


H = \{p'-A’)A-\p-A)-A {s +V. 
Similarly, if (3) and (4) are used in conjunction with (7), 


.( 8 ) 


H = T — q'A — 2A 0 + V. 

From the last equation it is seen that if the constraints are independent 
of the time (so that ^1 = 0 and A 0 = 0), then H reduces to the total 
energy T + V. 

To obtain the dynamical equations in Hamilton’s form, we note 
firstly that the Lagrangian equations (8-14*5) may be written 



But by (3) 


dT 


,.,dA. dA 


SAo 


which on application of (4) and use of the relation 

dA dA- 1 









276 EQUATION OF ENERGY 8*16 

may be expressed as* 

I - -Ur'-A') d -£<r-AMr'-A 

. (9) 

Hence the first set of equations isf 

. (io > 

The second set of equations, obtained by differentiation of (8) and use 
of (6), is /gm 

wr A ^*- A)= * . (11) 

The relations (10) and (11) jointly are the complete Hamiltonian 
equations. 

Examples 


(i) Reciprocal Property of a Dynamical System . If the kinetic energy 
is a homogeneous quadratic function of the generalised velocities, then 
by (4) p = Aq. Suppose q( 1) and q( 2) to represent two different sets 
of generalised velocities of the system in one of its configurations, and 
let p(l) and p(2) be the corresponding momenta. Then since A is 
symmetrical 

q'(\)p{2) = g'(lM2(2) = 3'(2M?(1) - g'(2)p(l). 

This reciprocal property should be compared with another given in 
example (i) of §8-11. 


(ii) Equation of Energy. If the constraints are independent of the 
time and the system is conservative, (10) and (11) give 


S (J,+ r >- 


dH 

dt 



- v'i-i'v - o. 


Hence for such a system the total energy is constant. 


* If, as is usual in aerodynamical applications, the generalised forces contain parts P 
dependent on the generalised velocity components then these parts must be expressed in 
terms of the variables p, q and t by use of (5) before they are introduced in the Hamiltonian 
equations (10). 

dT 

f The reader should be careful to note that in the differentiation on the left of equa- 

cqt 

dH 

tion (0) T is expressed as a function of q and t, whereas in the differentiation on the 

<7g| 

right of the equation, H is a function of p, q and t. 
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8*17.* Lagrange’s Equations with a Moving Frame of 
Reference. In some applications it is convenient to refer the displace¬ 
ments of the system to a base which is itself in motion. This moving 
frame of reference (not necessarily fixed in any body of the system) 
will as usual be denoted by 0(x v x 2 , x 3 ), and the Cartesian coordinates 
of a typical point P relative to the frame will be represented by x . 
Then in place of the geometrical equations (8*9*1) we shall have for 
the typical point . 


The coordinates of P referred to a fixed base 0(x x , x 2i x 3 ) will as 
previously be denoted by x, while v and a represent respectively the 
components of the total velocity and total acceleration of P, measured 
in the directions of the moving axes. Finally, u denotes the velocity 
components of 0 , measured in the same directions. The actual expres¬ 
sions for v and a in terms of u , x and of the angular velocities p of the 
moving base are given by equations (8*7*1) and (8*7*2). 

Now consider the respects in which the analysis of §8*14 requires 
modification to allow for the motion of the base of reference. The 
equations of motion of the typical particle P may in the present case 
be taken as ma = X , where X represents the column of the components 
of force in the directions of the moving axes. A treatment similar to 
that adopted with (8*14*1) leads to an equation which it will be con¬ 
venient to write as w o f 

So m^ a = So Z'-l. .(1) 

The quantity on the right is as before denoted by P { , and is referred to 
as the generalised force corresponding to q { . If the system is given an 
infinitesimal virtual displacement involving a variation Sq t of g* only, 
the work done by the external forces will be P^q^ In this hypothetical 
displacement the frame of reference 0(x v x 2 , x 3 ) is regarded as fixed. 

To evaluate the quantity on the left of equation (1), first substitute 
for a from (8*7*2). The result is somewhat cumbrous, but it can be 
reduced conveniently by the introduction of a centrifugal potential 
function defined by 

V 0 = l'Z 0 mx'm 2 x + T lo mx f mu + 'E 0 mx'u. 

Partial differentiation of this function yields 


3F 0 

dq d 


S 0 m| f -N, 
d 9i 


N = m 2 x -|- mu -f u. 


where 
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If further 
and 


4 = 


S 0 


3 /' . 

m-J-vjx 

3?i 



WXy 


the left-hand side of (1) can be written 


§5 + 7 <+ 0, +S ,w|£f. 

The last term in this formula is expressible in terms of the kinetic 
energy of the motion relative to the base—namely, 0 tnx'x —by the 
method of § 8-14. The quantity represents an inertia effect dependent 
on the displacements, but it cannot be derived from a potential since 
the matrix m is not symmetrical. Finally, the quantity 0 { depends on 
x and is therefore a linear function of the generalised velocities. If the 
coefficient of q i in O i is denoted by O ijy then 

(? « = 2S 0 m |;f; 

or on substitution for m from (8-5-4) 


hi = -2S 0 m^ 


' 9(/ a ./ 3 ) . _ 9(/a-/i) . _ 
Pl dM +P *W ^) +p3 


3 (ffi,?,)/ 


Hence O it = —Q }i , and in particular O u = 0. The quantities O i} are 
spoken of as the gyrostatic coefficients, and it is to be noted that on 
account of the properties just stated they do not appear in the expres¬ 
sion for the total energy. 

When the centrifugal potential V 0 is a homogeneous quadratic 
function, say V 0 = \q’<rq, the coefficients <r w are referred to as the 
centrifugal stiffnesses. They play exactly the same part as the gravi¬ 
tational and elastic stiffnesses. 


Example 

Suppose O to be always at rest, and let p t = p 2 = 0, while p a — £2 
(a constant). Then 

^, = Z 0 $ma;TO, -12, CfJ 2 a; = -$£2 2 2 0 m(a;f+«$). 

£ 2 , 0 , 0 
.0, 0, 0. 
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The typical generalised component of centrifugal force is thus 
3 F o~oav m ( r 9 /i 9 M 

~W< ~ s " m r‘¥,‘¥,)' 

and the typical centrifugal stiffness is 






= -Q 8 E 0 m 


8 *A ,/ 5 *A \ 
\3?< fyj 1 9?< W 


Finally, the typical gyrostatic coefficient is 


and J( — 0. 


Q i} = — 2f2S 0 m 


3(/i,/ a ) 

%<•?>) 
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CHAPTER IX 


SYSTEMS WITH LINEAR DYNAMICAL EQUATIONS 

9*1. Introductory Remarks. The present Chapter deals with 
motions governed by linear ordinary differential equations with con¬ 
stant coefficients. The language of the dynamics of material systems 
will be used throughout, but the treatment can, for instance, be applied 
equally well to electrical systems. The discussion and exemplification 
of approximate numerical methods of solution is reserved for Chapter x. 

9-2. Disturbed Motions. Except with very special systems or 
types of motion the differential equations which arise in dynamics are 
non-linear and do not admit exact solution. It is, however, sometimes 
possible to obtain particular solutions, such as those corresponding 
to equilibrium or steady motion. Then, if the system is supposed to 
be slightly disturbed from this known condition, the resulting small 
motion of deviation will be given by a set of linear differential equations. 
In the special case where the undisturbed state of the system is one 
of equilibrium or steady motion, the equations of disturbed motion 
will have constant coefficients and will be soluble by the methods of 
Chapters v and vr. In more general cases the equations will have for 
coefficients given functions of time, and they will thus be of the types 
considered in Chapter vn. 

The disturbances just referred to may be of two kinds. They may 
be merely temporary, and represented by a set of initial conditions 
of motion which differ slightly from those corresponding to the un¬ 
disturbed motion. The motion of deviation is then said to be free. On 
the other hand the disturbances may consist of small persistent forces 
which vary in some assigned way with time. In this case the motion 
of deviation is said to be forced . 

The usual method of construction of the equations of deviation is, 
briefly, as follows. Let the set of values of the generalised coordinates 
corresponding to the undisturbed state of the system be represented 
by Q: these values are to be regarded as assigned functions of t (or 
possibly given constants). In the disturbed motion let the deviations 
(i.e. the increments of the generalised coordinates) be q. Then to form 
the equations of disturbed motion substitute in Lagrange’s equations 
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the values Q+q for the coordinates, Q + g for the velocities, and so on 
neglect all terms involving products of the quantities q or of their deri- 
vatives, and introduce the persistent disturbing forces, if any. In this 
way a set of linear differential equations in the deviations q is derived. 
These equations are, of course, only valid for the representation of 
deviant motions which are “small’* in the sense implicit in the 
approximations used. 


9*3, Conservative System Disturbed from Equilibrium. The 

simplest problem of the class just considered is that in which a con¬ 
servative holonomous system with constraints independent of time 
receives a small disturbance from equilibrium. If, as will be assumed, 
the generalised coordinates are measured from this position of equi¬ 
librium, the quantities Q will all be zero. 

Now use the expanded form of Lagrange’s equations (see example (i) 
of § 8-14), and reject all the terms involving products of the deviations 
q or their derivatives. If there are no permanent disturbing forces, the 


equations are simply 



where the elements of A are all constants. Further, since q = 0 is a 
position of equilibrium, the terms which are linear in q will be absent 
from V, so that this function may (to the order of approximation 
considered) be taken as y — e + \qfEq 


Hence the required equations of free disturbed motion are 


Aq + Eq = 0, 


(1) 


in which A and E are both symmetrical matrices of constants. 

If the disturbed motion is forced, the equations are of the more 
general type Aq+Eq = 


in which g(Q represents the column of the disturbing forces. 

The principal diagonal elements of A, which obviously play a part 
analogous to ordinary moments of inertia, may for convenience be 
spoken of as the generalised moments of inertia . For a similar reason it 
is natural to refer to the elements of type A {j (i+j) as generalised 
'products of inertia . Finally, the elements E xi are described as the 
direct elastic stiffnesses , while those of type E {j (i+j) are called elastic 
cross stiffnesses. 




282 


DISTURBED STEADY MOTION 


9 - 3-94 


Example 

Transformation of the Coordinates . In the discussion of the Lagran- 
gian equations of small motions it is sometimes convenient to transform 
the m generalised coordinates q to a new set, say q. Suppose the two 
sets to be connected by the linear substitution q = uq 9 where u is a non¬ 
singular square matrix of order m with given constant elements. Then 
T = \q’Aq = 

and V = E 0 + \q'u'Euq. 

The equation corresponding to (1) is accordingly 

(u'Au)(i + (u'Eu)q = 0 . 

It should be noted that u'Au and u'Eu are both symmetrical. 

9-4.♦ Disturbed Steady Motion of a Conservative System 
with Ignorable Coordinates. The state of steady motion considered 
is that in which all the non-ignorable coordinates, and the generalised 
velocities corresponding to all the ignorable coordinates, have con¬ 
stant values. It has been shown in §8-15 that when a system has 
ignorable coordinates the dynamical equations are reducible to a 
simpler set which involves only the non-ignorable coordinates. The 
final equations are similar in general form to Lagrange’s equations, 
but the Lagrangian function L is replaced by a function R which may 
contain terms linear in the generalised velocities. It is the presence of 
these linear terms in the “modified” Lagrangian function R which 
distinguishes this problem of disturbed steady motion from that of 
disturbed equilibrium.f 

As in §8*15 we shall suppose that there are m non-ignorable co¬ 
ordinates q y and k ignorable coordinates y. Then if the kinetic energy 
function is represented by 

T = W<x4 + y’y<i + Irj'Srj, 

the k dynamical equations appropriate to the ignorable coordinates 
give on integration (see (8*15*2)) 

yq-\-8tf = o', .(1) 

while the remaining m equations for the system in its reduced form are 
(see (8*15*8)) 


in which R = \q'(a - fiS^y) q + q'fiS-'cr - £ <r'8-i<r - F. 

t Compare p. 196 of Ref. 31. 


dt\dqj (dg'J ~ ’ 
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In steady motion the velocities rj are all constant. Moreover, the 
non-ignorable coordinates also have constant values which, without 
loss of generality, may be taken to be zero. The quantities q then 
represent the deviations of the non-ignorable coordinates in the 
disturbed motion. Again, corresponding to the constant values of rf 
and of q in the steady motion there will be a set of constants given by 
(1). These same constant values for cr are assumed to hold good for the 
disturbed motions to be considered.! 

To derive the equations of free disturbed motion, neglect all terms 
in the expanded form of R which are above the second degree in the 
variables q and q. To this order of approximation R is quadratic in the 
variables q and q. It will be convenient to write 

in which now 51, 93, ©, etc. are all matrices of constants. The square 
matrices 91 and (£ are symmetrical, but 93 is in general not symmetrical. 
Then 

\w} = ^' + ® and = + 

Hence equation (2) becomes 

2lg + (93 - 93')?+6?-3f = 0. 

Since this is satisfied by q = 0 , we require g = 0 . Hence finally 
2t?>(93 - 93')<Z + (£<Z = 0. 

This equation is of a rather more general form than that corresponding 
to disturbed equilibrium, in that it contains terms dependent on the 
generalised velocities. The elements of the skew symmetric matrix 
93 — 93' are spoken of as gyrostatic coefficients (compare also § 8*17). 

9*5. Small Motions of Systems Subject to Aerodynamical 
Forces. The aerodynamical forces acting on a system will not, in 
general, be conservative, and will, strictly speaking, depend on the whole 
previous history of the motion. When such a system is disturbed from 
a known condition (e.g. equilibrium or steady motion) it is usual to 
assume as an approximation that the differences between the values 
of the air forces in the disturbed and undisturbed states of the system 
depend linearly on the sets of quantities q , q, q, at most, where q 
denotes the deviations of the generalised coordinates. Hence, if 
and represent respectively the typical generalised aerodynamical 

t This implies that the disturbances do not introduce generalised forces corresponding 
to any of the ignorable coordinates. 
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forces for the disturbed and undisturbed states, then according to the 
foregoing approximation 



The constant coefficients may be referred to as the 

dq j d 9j d qj J 

generalised aerodynamical derivatives . If the column of the generalised 
aerodynamical forces is expressed as 

= y^ — Wq — Bq — Aq, 


then 




V « Sq/ Aii dq } ' 

To obtain the equations of disturbed equilibrium of an aerodynamical 
system the method of § 9-3 is followed, but the generalised forces 
are included. The final equations may be written 


(A + A)q + Bq + (E + W) q — £(t). 

The elements of A, with their signs reversed, are called the acceleration 
derivatives. They are generally small, and are either treated as negli¬ 
gible or are assumed to be of such a nature that they can be absorbed 
into the generalised moments and products of inertia without destruc¬ 
tion of the symmetry of the inertia matrix A. On this understanding 
it is usual to write the last equation as 


Aq + Bq+Cq = £(t), .(1) 

where C = E -f W. The elements of B are described as the damping 
coefficients, while those of W are the aerodynamical stiffnesses. In 
general neither the damping matrix B nor the total stiffness matrix G 
will be symmetrical. The principal diagonal elements of C are spoken 
of as the direct stiffnesses, while the remaining elements are cross 
stiffnesses ; similarly, the principal diagonal elements of B are direct 
dampings, while the remainder are cross dampings. The coefficients 
of type A ip B ip Cy (i+j) are collectively called the couplings. 


9-6. Free Disturbed Steady Motion of an Aeroplane. 

The equations in this case can be deduced directly from the equations 
of general mot 7 m of an aeroplane given in § 8* 13. The symbols have the 
same meanings as before, except that they now represent incremental 
values in the disturbed motion: the corresponding values in the 
given steady motion are denoted by the same symbols with a cipher 
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suffix. As is usual in the treatment of aeroplane stability,* the body axes 
are assumed to be principal axes, so that the inertia matrix J will have 
the diagonal form J = „ Q _ 


The scheme of conversion to standard aeronautical symbols will be 
considered later. 

Equations (8-13-1) and (8-13-2) give immediately for the disturbed 
motion (ID+w + vj 0 )(u+u 0 ) = X + X 0 + O + O 0 , 

(ID + m + m 0 ) J(p+p 0 ) = L + L 0 . 

Omitting the terms which arise only in the steady motion and neg¬ 
lecting squares and products of the deviations, we obtain for the deviant 

motion (ID + m 0 ) u + mu 0 = X + 0, .(1) 

(ID + m 0 ) Jp + xuJp 0 = L. .(2) 

Now (see example (i) of § 8-5) mu 0 = — a 0 p , where 


&0= 0 “”^30 ^20 

^30 0 ~ ^10 

-u 20 u 10 0 

Similarly, m 0 Jp + w Jp 0 = — b 0 p, 

where b 0 = T 0 (J 2 - J 3 )p 30 (J 2 - J 3 ) p 20 ~ . 

(t/3 — J X ) P 3Q 0 (J 3 — Jl)PlQ 

(JjL —J 2 )^20 04 ~ ^PlO ^ 

Again, expressing X and L in terms of aerodynamical derivatives, and 
neglecting acceleration derivatives, we may write 

X = ®nU + <l> 12 p, 

L = 0> 21 w + $ 22 p, 

in which ® n , 0 12 , 0 21 , <P 22 are square matrices the (i,j) th elements of 
which are respectively the derivatives 

dXf dXj 0£ t - dL{ 
du f 9 3 pj 9 duj 9 dpj ’ 

Next, by differentiation of the expression (8-13-3) for the com¬ 
ponents of the weight in a general motion, it readily follows that in 

* See, for instance, Chap, x of Ref. 27. 
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the present deviant motion 

0 = gc 0 <f>, 

where c 0 = [" 0 — cos0 2 o 0 

cos ^20 $io sin ^ 2 o sm ^ 

cos ^20 ^io sin ^20 ^io ^ 

On making these substitutions in (1) and (2) we derive the equations 


(® lx -m 0 -ID)u + (®i 2 +a 0 )p + gc 0 <f> = 0, .(3) 

J-'<!> 21 u + (J-'<I> 22 + J-'b 0 -ID)p = 0.(4) 


It remains, finally, to connect the deviations p with the deviations <j> 
and their rates of change. The necessary relation, which is deducible 
by differentiation of (8-13-4) and use of (8-13-7), may be written 

v “ (R 0 n+nc 0 )<f>, 

where c 0 is as defined above, and R 0 is the matrix in (8-6-6) with the 
values <p 0 appropriate to the steady motion substituted for 0. A rather 
more convenient form is 


^p + (Oe 0 -/D)^ = 0, .(6) 

in which e 0 s — jR^Cq = sec (f > 2o r 0 1 Cf|. 

I COS 2 0 0 


[ 0 sin <f> 20 oj 

The required equations of deviant motion are (3), (4) and (5). They 
may be written concisely by partitioned matrices as 

^{u,p,</>} = r{u,p,<f>}, .( 6 ) 


with 

T as 

t? 

1 

T* 

e 

®12 + °0 

gc o' 



J- 1 ** 

+ b 0 ) 

0 



0 

Ho 1 

Qe 0 _ 


The scheme of conversion to standard aeronautical symbols is as 
given in the addendum to § 8-13, with the simplifications D = E = F = 0. 
dX BX 

The derivatives e ^ c * correspond in the standard notation 

BX BX * ^ 

t° du ’ ~dv' e * C "’ an< * ^ ese are u8uall y abbreviated to X u , X v , etc. 


X v , etc. 


The A-determinant of the system of equations (6)—after extraction 
of a factor A indicating neutrality of the aeroplane with respect to 
the angle r/r —can be expressed in standard symbols as follows: 
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5 o* 
| 


<a> c» 

I I 


1 8 


H 


L »> * 


* f 1 

1 -i c 

S*T|5> I ® 


9 

1 tn 5, 
***• 

N 


# 

+ 

6^ 
i ^ 

*< 

1 

«* 

1 O 

•£ *9 

W o 

nT 

^_ 

+ 

N*h 

^_ 

-f- 

8 ® 
1 


I l 


•^?N Jsffo Saf|o ©“ °' 

Sfh fef|o ® ® 

«Th nfh fefta ® °" 


+ nTN 
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9*7. Review of Notation and Terminology for General 
Linear Systems* The few special problems already considered show 
how linear differential equations with constant coefficients arise 
naturally in dynamics. We shall now briefly summarise the essential 
formulae connected with the solution of such systems of dynamical 
equations.* 

The Lagrangian equations for a linear system having m generalised 
coordinates q v q 2 ,q m will be of the type 

f(D) q m (AD* + BD+C)q = g(t) 9 .(1) 

where A, B, C are respectively the inertia , damping and stiffness 
matrices, and £(t) denotes a column of assigned “forcing” functions. 
In a free motion of the system £(£) = 0. 

The determinantal equation 

A(A)s|/(A)|=p 0 A”+p 1 A n - 1 + ...+p„_ 1 A+p n = 0 

is in general of degree n = 2m, and the n roots (not necessarily all 
distinct) are denoted by A 2 ,..., A n . The constituent solution appro¬ 
priate to an unrepeated root Ay is denoted by 

q = {yeV=i r eV, 

and the column of constants k r is referred to as the rth modal column. 
This column may be chosen proportional to any non-vanishing column 
of the adjoint ^(A,.) of the matrix /(Ay). On the other hand, if A s 
represents a member of a set of s equal roots, the constituent solution 
appropriate to A s is written 

Q = {y*)}e v =^(*)c v . 

In this case the s modal columns relevant to the complete set of roots 
equal to A s may be chosen proportional to any s linearly independent 
columns of the family of matrices 

U 0 (t,X a ) = F(X S ), 

(i) 

u x {t, A.) = F(A a ) + tF( A s ), 


(*-2) fo_lWo 0\ (8—3) 

A.) = F( A.) +(s-l)tF(\„) + ( ’ t 2 F(X e ) +... + t°-'F(A a ). 

The modal coefficients k u (t) will be polynomials in t of degree s — 1 at 
most. 

♦ For the detailed exposition see Chaps, v and vi. 





9 - 7 - 9-8 


REDUCTION TO FIRST ORDER 


289 


When g(t) = 0 the most general solution of equations (1), derived 

as a linear superposition of arbitrary multiples of the constituent 

solutions, is T /JX ., , w 

q = c 1 k 1 (t)e x i i +c 2 k 2 (t)e^+ ...+c n k n {t) eV 

in which c v c 2 , etc. are arbitrary constants, real or complex. This 
solution may be written more concisely as 

q = k(t)M(t)c. 

Here k(t) denotes the (ra, n) modal matrix 

*« = I 


and 


k n (t), 

^2l(0> 

^12^)y 

^ 22 ( 1)9 

••*> ^ln(^) 

•••> ^2 n(0 

_^ml 

(<). 


?5- 

2 

c*. 

1- 

)S 

'eH 

o, 

0- 

9 


o, 

e A «*, 

0 



. °> 

o, 




while c represents the column of the arbitrary constants. 

It is sometimes convenient to substitute for the Lagrangian equa¬ 
tions (1) an equivalent set of equations of the first order,* obtained 
by the use of auxiliary variables. If the generalised momenta p = Aq 
are adopted as the auxiliary variables (see § 8-16), then the complete 
set of equations in the Hamiltonian form is 

= r o, a -' 


.v. 



i 

+ 

O 

■1 



i(*)J 


-C, - BA ~ 

A convenient alternative scheme is to adopt the generalised velocities 
q as the auxiliary variables. The equivalent linear system then is (see 
example (ii) of §5-5) y = U y+ V (t), 


where y = 

V 

; vV) = 

■ 0 ' 

; u = 

• 0 , 1 


3. 


A~m. 


-A-'C, -A-'B 


9*8. General Nature of the Constituent Motions, (a) Real 
Roots. The modal column obtained from each real (simple or multiple) 
root of the determinantal equation of the set of equations (9-7*1) will 

* For special methods of solution of first-order systems of equations, see Part II of 
Chap. yi. 







290 


CONSTITUENT MOTIONS 


9-8 


consist of real elements. The corresponding constituent motion will 
be a subsidence or a divergence according as the root concerned is 
negative or positive. A zero root, if unrepeated, yields a constant con¬ 
stituent representing neutral equilibrium. On the other hand, a 
repeated zero root in general gives rise to a constituent involving 
polynomials in t, and thus to a growing motion. 

'(b) Complex Roots . The modal column derived from a complex root 
is, in general, complex. It is customary in dynamics to associate 
together the conjugate complex constituents obtained from conjugate 
complex pairs of roots, and so to derive equivalent pairs of real con¬ 
stituent motions. Suppose X 8 = p + ioj to be a root of multiplicity 8, 
where i denotes V— 1: then the conjugate complex root X a — p — io 
also has multiplicity s. Hence for every modal column obtained from 
the set of 8 roots equal to A s there will be a corresponding conjugate 
complex modal column derived from the set of s roots equal to A 8 . 
If k 8 (t) = a(t) + ij3(t) and k a (t) = a(t) — ifi(t) are, respectively, the 
modal columns appropriate to X 8 and A fl , and if pe iT is a complex 
arbitrary constant, then the most general real linear combination of 
the two constituents concerned will be expressible as 

q = pel* cos (art-f r) a(£) — pepsin (ojt + r) fi(t). 

In the special case where the roots X 8 and X 8 are unrepeated, the 
elements of a and y? are constants, and then the last equation may be 
written 

q = pertyj cos (cut + e j + r)}, 
where {% e u i] = a -f i/3. 

The motion in this case may be regarded as effectively a single oscil¬ 
latory constituent having an arbitrary amplitude p and an arbitrary 
epoch t. The motion will ultimately die away, or grow to an indefinitely 
large amplitude, according as the real part p of the root X 8 is negative 
or positive.* 

(c) Purely Imaginary Roots, f If damping forces are present in the 
system (i.e. if 15 + 0), the modal column corresponding to a purely 
imaginary root will as a general rule contain complex elements. The 

* The general character of the roots of A(A) = 0 can be ascertained by the use of the 
test functions (see § 4*22 (6)). 

t The important case in which B = 0 and C is symmetrical is discussed separately 
in § 9*9. 
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real constituents appropriate to such a root and its conjugate imaginary 
will thus be of the types considered in (6) with the simplification ji — 0. 
Simple imaginary roots give rise to pure sinusoidal oscillations, while 
repeated imaginary roots will in general yield oscillations whose 
amplitudes grow in proportion to a polynomial in t. 

The dynamical coefficients sometimes depend on one or more 
variable parameters. If the values of these parameters are such that 
one or more of the constituent motions of the system are simply 
sinusoidal, the state of the system is said to be critical , and the para¬ 
meters have critical values . The critical state requires that ± io) shall 
be roots of A(A) = 0, so that 

T n _ i = 0, .(I) 

where T n _ x is the penultimate test determinant (see § 4-22 (6)). When the 
parameters are such that (1) is satisfied, they are either critical ac¬ 
cording to the definition, or else A(A) = 0 has equal and opposite roots. 
The transition from a completely stable state of a system to oscillatory 
instability is always indicated by the vanishing of T n _ v A simple 
illustration of a critical parameter is the critical speed for flutter of an 
aerodynamical system (see § 12*1). 


9-9. Modal Columns for a Linear Conservative System. 

It is a well-known theorem* that if the dynamical equations are of 

the type /(D) q = (AD 2 + E)q = 0, .(1) 

in which A and E are both symmetrical, the roots of A(A) = 0 are 
either real or purely imaginary (i.e. they cannot be complex): they are 
all purely imaginary if the potential energy V = iq'Eq is a positive 
function. 

The usual proof is as follows. Write A 2 = — 2 and 


A m (z) = 


A u z-E llt 

A 2 iZ — E 2V 


A xi z — E liy A lm z — E lm , 0 

A 22 Z — E i2 > •••> A 2Tr i^ E 2m y 0 


Aml^ A m2 z E m 2 , A mm z E mm , 0 

0 , 0 , 0 , 1 

.( 2 ) 

Also, let A m _ 1 (z) be the determinant obtained when the first row and 
first column of A m are erased, A m _ 2 (z) be the determinant formed when 


* For historical notes see p. 183 of Ref. 31. 
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the first two rows and columns of A m are erased, and so on until finally 
A 0 = 1: the degrees in z of these determinants are represented by the 
suffices. Since the discriminants of the kinetic energy function are 
all necessarily positive (see example (ii) of §8*14), the determinants 
A 0 ,A X , ...,A m are alternately positive and negative when z = — oo, 
but are all positive when z = +oo. From example (i) of § 1-17 it is seen 
that when z passes through a root of any intermediate determinant 
of the series, say A i (O + i + ra), then A i-X and A m necessarily have 
opposite signs. Now A 0 has the constant value 1, while the one root of 
the equation A x = 0 is obviously real. Hence for this root A 2 < 0. But 
A 2 > 0 when z = ± oo, so that the equation A 2 = 0 has two real roots 
which are separated by the root of A x = 0. Similarly, it follows that 
A s = 0 has three real roots separated by the roots of A 2 = 0, and more 
generally the roots of A^ = 0 are all real, and separate those of A i+1 = 0. 
Since z = — A 2 , it follows that the 2m roots of the original determinantal 
equation are either real or purely imaginary. 

If the potential energy is a positive function, the discriminants of V 
are all positive. Accordingly the signs of the determinants A 0 (z), A x (z), 
A m (z) are alternately positive and negative when z = 0, so that 
all the roots of A m = 0 are positive. By the same argument as before 
it follows that the roots of A(A) = 0 in this case are all purely imaginary. 

Since the roots of A i (z) = 0 separate those of A m (z) = 0, it is seen 
that if A m (z) = 0 has s equal roots z 8 , then A m _ x (z) has s — 1 equal roots 
z 8 , A m _ 2 (z) = 0 has 5-2 equal roots z 8 , and so on, until finally z 8 is a 
simple root of A m _ s+1 (z) = 0. Moreover, these conclusions are true for 
any other arrangement of the principal diagonal elements in (2). Now 
refer to example (i) of § 1*17. If the two determinants on the left of 
equation (1* 17* 1) contain respectively the factors (z — z 8 ) 9 and (z — z 8 ) 8 ~ 2 , 
and if the first two on the right each contain (z — z 8 ) 8-1 , then obviously 
the non-diagonal minor (there called A 12 ) must also contain the factor 
(z—z^) 8-1 . This shows that if z 8 is an 5-fold root of A m (z) = 0, then 
(z — Zg)*- 1 is a factor of every first minor of A m (z). It follows that if 
A 8 , and therefore also - A 8 , are 5-fold roots of A(A) = 0, then (A 2 - A 2 ) 8 - 1 
is a factor of every first minor of A(A). The adjoint matrix F( A) ac¬ 
cordingly contains the same factor, so that F(\ 8 ) and all the derived 

(8-2) 

adjoint matrices up to and including F(\ 8 ) are null. In this case also 

the matrices U^t, A g ) are all null with the exception of C4-i(^> A a ), which 

( 8 - 1 ) 

reduces to the constant matrix F(X a ). Hence the following important 
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theorem: When a conservative system is disturbed from equilibrium the 
modal columns are (in general) all independent of t even when multiple 
roots of A(A) = 0 occur . 

The exception to the foregoing theorem arises when E is singular 
and A g = — A s = 0 , so that there are in all 2s zero roots of A(A) — 0 . 
In this case A 28 ” 2 is a factor of every first minor of A(A), so that the 
adjoint matrix is of the form A 25_2 G(A 2 ). Here all the derived adjoint 

(2a—2) 

matrices involved, with the exception of F( A 8 ) , are null, and the 2s 
modal columns corresponding to the 2s zero roots are obtained from 

(2a—2) (2a—2) 

C4a —2 an d C4a- 1 > which reduce respectively to F(X 8 ) and (25—1) tF(X 8 ) . 

(2a-2) 

If 1c { is any one of the s linearly independent columns of F(X 8 ) , the 
corresponding two constituent motions are + 6^)^, where a t and 
b i are arbitrary constants. This exceptional case, in which E is singular, 
will hereafter be assumed to be excluded. 

When A,, is a simple root the appropriate modal column k r is given as 
usual by any non-vanishing column of the adjoint matrix J? , (A r ). 
Since the elements of this matrix are polynomials in A 2 , and since A 2 is 
real, the modal column corresponding to A,, is real, and is the same as for 
the root — A r . If there are 8 roots equal to A g , the corresponding s modal 
columns are given by the s linearly independent columns which the 
(«- 1 ) 

matrix F(X S ) necessarily contains. It is easy to see that, as for simple 
roots, the modal columns appropriate to the set of roots X 8 can always 
be taken to be real and equal to those for the roots — X 8 . Hence in 
all cases, if the roots are ± A x , ± A 2 , ± A m , and if k 0 is the square 

matrix of order m formed from the m modal columns appropriate to 
X v A 2 , ..., A m , then k Q also gives the modal columns appropriate to 
Aj, A a , • ••> A^. 

Examples 

(i) Roots all Distinct. Suppose 

/(A) = [ 44A 2 + 108 - 8A 2 0 

-8A 2 35A 2 -f 54 12A 2 

0 12 A 2 28A 2 + 36 

Then A(A) = 16 x 27 x 81(A 2 + 1) (A 2 -f 2) (A 2 -h 3), so that the roots of 
A(A) = 0, say A x = +i, A 2 = +i*j2, A 3 = + i^/3,andA 4 = ~i, A 6 = -^2, 
A 6 = — i^/3, are all distinct. 



294 


LINEAR CONSERVATIVE SYSTEMS 


9-9 


The adjoint matrices corresponding to A x , A 2 , A 8 are 
F( Aj) = 8{1, - 8, - 12} [1, - 8, -12], 
.F(A 2 ) = -16{4,-5,6}[4,-5,6], 
J’(A 3 ) = 72{4, 4,-3} [4, 4,-3], 
so that we may choose 

* 0 =r 1 * 4 

-8 -5 4 

-12 6 -3 


(ii) Repeated Roots. An illustration is provided by the system 
already considered in the example to § 6-6, for which 


/(A) = T32A 2 + 48 -16A 2 

— 16A 2 32A 2 + 48 


11A 2 + 12 


A (A) = 3 x 48 2 (A 2 +l) 2 (A 2 + 4). 


If the roots of A(A) = 0 are denoted by A : = A 2 = + i, A 3 = + 2 i, 
A 4 — A 6 — i y A # —— 2i, then 

(i) 

F(X 2 ) = 0 with jF(A 2 ) = 2 x 48ir 5-4iri 0 4], 


while ^(A 3 ) = 3 x 48 x 16{1, 1, - 1} [1, 1, -1]. 

Hence we may choose k 0 = [" 5—4 IT. 

-4 5 1 

4 4-1 


(iii) Matrix E Singular. Suppose 

/(A) = pA 2 +l, 2, 3 

2, 12A 2 + 4, 6 

3, 6, 27A 2 + 9_ 

Then A(A) = 12 x 8lA 4 (A 2 +1) and 

i’(A) = 3A 2 p08A 2 + 72, -18, -12]. 

-18, 27A 2 +18, -6 

-12, -6, 12A 2 +8. 
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For the quadruple zero root Aj = A g = A a = A 4 = 0, the three matrices 

„ _ (1) (3) 

F(A 4 ), F( A 4 ) and F(A 4 ) are null, but 


( 8 ) 


^(A 4 ) = 6 

72 

-18 

-12' 

= 12 

' 3 

6' 

re -3 o] 


-18 

18 

-6 


-3 

3 

L3 0 -lJ 


.— 12 

-6 

8. 


1 

-4. 



Hence the constituent motions appropriate to the four zero roots may 
be taken as (a+bt) {3, -3, 1} and (c+dt){ 6, 3, —4}. 

9*10. The Direct Solution for a Linear Conservative System 
and the Normal Coordinates. From the discussion of §9-9 it 
follows that if the roots of A(A) = 0 are denoted by ± A x , ± A 2 , ± A in , 

the complete motion of the system is represented by 

q(t) = k 0 M 0 (t)a + k 0 M 0 (~t)b y .(1) 

where a and b are two columns each containing m arbitrary constants 
of integration, k 0 is the matrix of the m modal columns appropriate to 
the roots X l9 A 2 ,..., A^, and M^{t) is the diagonal matrix 

0 ... 0 1 . 

0 eW ... 0 


L 0 0 ... 

If the values of the generalised coordinates and velocities at t = 0 are 
assigned, and if A 0 denotes the diagonal matrix of the roots A x , A a , ..., 
A m , the direct solution is easily verified to be 

9(0 = + ^o( “ 0] + Po[A*o(0 ~ Af 0 ( - 0] A^&o^O). 

The corresponding special form of the solution has already been 
obtained in equation (6*6*4). 

If new generalised coordinates Q are adopted given by 

q = k 0 Q> 

the general solution (1) reduces to 

Q = M 0 (t)a+M 0 (-t)b, .(2) 

while the equations Aq + Eq = 0 transform to (see example to § 9*3) 

( [k 0 AkQ ) Q + (& 0 *®&o) Q “ 0* .(*0 

From (2) it is seen that when the generalised coordinates are chosen 
in this way a disturbance of the system restricted to any one of its 
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coordinates Q r produces motion in that particular coordinate only; 
in other words, the motions in the several degrees of freedom are un¬ 
coupled. The coordinates Q are usually termed the normal or principal 
coordinates. 

Evidently, if d 0 denotes an arbitrary non-singular matrix, the 


system of differential equations 

d 0 (Q-A*Q) = 0 (4) 

will be satisfied by (2). Choosing 

d 0 — k 0 Ak 0i (5) 

we then have also, by (3) and (4), 

-d 0 Al = k' 0 Ek 0 . (6) 


Now k' 0 Ak 0 and kgEk 0 are both symmetrical; hence by (5) and (6), 
d i5 — d j{ and — d tj Af = —d^ Af, where d^ denotes the typical element 
of d 0 . Accordingly, if the roots of A(A) = 0 are all distinct, 
dy = dfr — 0, and d 0 will therefore be a diagonal matrix. The normal 
coordinates, which can clearly be taken in arbitrary multiples, may 
in this case be chosen to be 

Q = d^k^q, 

or, on application of (5), Q = k' 0 Aq. .(7) 


When A(A) = 0 has multiple roots, the matrix d 0 need not be 
diagonal. For instance, if A t = A 2 , then d 12 = d 21 , but these elements 
do not necessarily vanish. The diagonal matrix d 0 is in this case 



More generally, if the m roots A x , A 2 ,..., A m consist of a roots equal to 
A a , b roots equal to A 6 , and so on up to p roots equal to A p , where 
a + 6 +... = m, then 

d 0 = K, 0, ..., 0 

I d b , ..., 0 


0 , 0 , • • •, dp 
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where the typical square submatrix d i is symmetrical, non-singular, 
and of order i. It is easy to see that the normal coordinates can be 
defined in accordance with the formula Q — k' 0 Aq even when multiple 


roots occur. For 


& 0 1 ? = M(t)a + M( — t)b, 


and since d 0 is obviously permutable with M(t), it follows that 


= dH(t)d 0 a + M{ — t)d 0 b. 


Hence Q = M(t)a + — 


where a and /? are columns of arbitrary constants. The coordinates Q, 
defined in accordance with the formula, are thus normal. 


Examples 

(i) System with Roots all Distinct. Suppose the given equations to be 
' 44 -8 OTj+riOS 0 01? = 0. 

-8 35 12 0 54 0 

0 12 28j 0 0 36 

Then, if the modal columns are chosen as in example (i) of § 9-9, 
k 0 =\ 1 4 41. 



Hence the normal coordinates may be taken as 

# = i'A? = fl -8 -121 [ 44 -8 Ol? 

4 -5 6 -8 35 12 
4 4 — 3 J 0 12 28_ 

= ("108 -432 -4321?. 

216 -135 108 

144 72 -36 

As a verification of (5) and (6) 

k' 0 Ak 0 = T108 -432 —4321 T 1 4 4' 

216 -135 108 -8 -5 4 

144 72 -36j[-12 6 -3 

= [8748 0 0 ]=d 0 . 


0 2187 0 

0 0 972 
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Similarly it is easily shown that 


KEk 0 = 

"8748 

0 

0 ■ 


0 

4374 

0 


0 

0 

2916_ 


-AX- 


(ii) System with Multiple Roots. If the equations are 
' 32 -16 4]$+r48 0 0"Ig = 0, 

-16 32 4 0 48 0 

4 4 llj [o 0 12 . 

the roots of A(A) = 0 are A x = A 2 = + *, A 8 = +2*, A 4 = A 8 
A« = — 2*. As in example (ii) of § 9-9 

k a = f 6-4 -11, 


— and 


6 -1 
4 1 


and the normal coordinates may be taken as 
Q = k' 0 Aq = f 6-4 4][ 32 


6 -4 
-4 6 

-1 -1 


32 -16 
16 32 


240 -192 48] q. 
-192 240 48 

-12-12 3 


In the present case 

k 0 Ak 0 — 


while 


KEk 0 - 


= r 2160 -1728 0 ] = d { 

-1728 2160 0 

0 0 27 

2160 -1728 0 1 =-AJ4 

-1728 2160 0 

0 0 108 


(iii) The Matrix E~ X A. Since k^M^t) is a solution of Aq+Eq = 0, 
it follows that Ak(iA i 4_ Eko = 0 

When E is non-singular this may be written 

o = — Aj 8 , 

so that the canonical form of E~ l A is always diagonal, even when 
repeated roots occur. Hence E~ X A has the modal matrix k 0 and the 
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latent roots — Af, — A|, — AThis result will be used in Chapter x 

in connection with the solution by iterative methods of the dynamical 
equations in the form E~ x Aq + q = 0. The matrix E~ X A is there re¬ 
ferred to as the dynamical matrix . 


9*11. Orthogonal Properties of the Modal Columns and 
Rayleigh’s Principle for Conservative Systems. If A, and A a are 

any two different roots of A(A) = 0, then, since k r e A ** and k 8 e are 
solutions of/(D) q = (AD 2 + E) q = 0, it follows that 


f(K)K = 0, (1) 

/(A.) 1.-0. (2) 

Hence also k 8 f( A r )i; r = 0, .(3) 

KfiK)K = 0. (4) 

But, since in the case of a conservative system /(A r ) and /(A a ) are 
symmetrical, (3) may be transposed to give 

Kf(K)ks = 0. (5) 

On subtraction of (4) from (5) we have the result 

(K — ^DKAkg — 0 . 

Hence if A,.# ± A a , k' r Ak 8 = 0 , . (6) 

so that also k' r Ek 8 = 0. .(7) 

Equations (6) and (7) express the orthogonal properties of the modal 
columns.* 

On premultiplying equation (1) by k' T we have 

A *k' r Ak r ~-k' r Ek r . (8) 


Equation (8) can be regarded as defining A r as a function of the ele¬ 
ments k ir of the modal column k r . On differentiation of (8) with respect 
to each of the elements k ir in succession we derive the set of equations 


\\^KAk r = -A \Ak r -Ek r = -/( \)k r = 0. 
Since k' r Ak r is an essentially positive function, it follows that 



(9) 


The value of A, as defined by equation (8) is accordingly stationary for 
small variations of the modal columns from their true values. This 


* See § 92 of Ref. 29. 
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constitutes Rayleigh’s principle,* and by its aid very accurate values 
of frequencies of oscillatory constituents can be obtained from rather 
rough approximations to the corresponding modal column. If in (8) 
we regard k r as defining an approximate mode, the frequency n T will 
be given by the formula 

4 n*n z r = k' r Ek T jk' r Ak T . .(10) 


Examples 

(i) Orthogonal Properties of the Modal Columns. Referring to the 
system considered in example (ii) of § 9-9, we have 

A = f 32 -16 41: E = f48 0 0l. 


32 

-16 

II 

‘48 

0 

O' 

16 

32 

4 

0 

48 

0 

4 

4 

llj 

_0 

0 

12 


The two modal columns appropriate to the double roots ± i are here 
k x = {6, —4, 4} and k 2 = {—4, 5, 4}, while the column appropriate to 
the simple roots + 2* is k s = (1,1, — 1}. 

Applying equations (6) and (7) we have 

k' 3 Ak 2 = [1, 1, — 1] f 32 -16 4 ir- 4 l =[12, 12,- 3 ]r- 4 ] =0, 


-16 32 4 

|_ 4 4 11 

Jfcg Ek t = 12[1, 1, — 1] f4 0 01 f 


41 = 12[4, 4,—l]f —41=0 


i:: iJl i :j 

Similarly, V z Ak x = [12, 12, -3] {5, -4, 4} = 0, 
iY z Ek x = 12[4, 4, - 1]{5, -4, 4} = 0. 

On the other hand 

K A h= [ — 4,5,4] 1“ 32 -16 4] f“ 5] = [-4,5,4] T 2401 + 0. 

-16 32 4 -4 -192 

4 4 llj L 4J L 48 . 

(ii) A Generalisation of Rayleigh's Principle for Dissipative Systems 
of Special Type . In the case of dissipative systems the equations of 
free motion have the form 

f(D) q = (AD* + BD + G) q = 0. 

* For a fuller discussion of Rayleigh’s principle see Ref. 32. 


2401 + 0. 
-192 
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If A,, is a simple root of A(A) = 0, and if k r is the corresponding modal 
column, then /(A,) k r = 0. On premultiplication of this equation by 
K the result is X % T k' T Ak r + X T k' T Bk r + KCk r = 0. 

As for equation (8) we may now differentiate with respect to the 
elements k ir of k r in succession. When A, B, C are symmetrical the 
equations so obtained can be written 

(|^J (2A XAk r +KBk r ) = — 2/(A r ) k T = 0. 

Hence in this special case jj^j = 0. 

(iii) Relations between Modal Columns for Dissipative Systems. If 
k r , k a denote two modal columns distinct from k t , then by appropriate 
premultiplication of the equation /(A*) k t = 0 we have 

tfk' r Ak t + X t k' r Bk t + k' r Ck t = 0, .(11) 

)$kgAk t + X t kg Bk ( + k' t Ck t = 0. .(12) 

Elimination of A, from (11) and (12) yields a relation between the modal 
columns k r , k s , k t . Two similar relations can be obtained by interchange. 
If r and t are interchanged in (11), then 

W l Ak r + \ r k' t Bk r + k' t Ck r = 0, .(13) 

and in the special case where A, B, C are all symmetrical the coeffi¬ 
cients in (11) and (13) will be identical. Thus A, and A; will be the roots 
of the quadratic equation 

A%Ak,+Xk' r Bk,+k' r Ck ( = 0. .(14) 

As a numerical illustration suppose 

/(A) = p 0JA 2 +|T 2JA+|T Oj, 

with A(A) = (A 2 +1) (2A + 3) (A +1) 

and E(A) = rA 2 + 2A + 3, -2A ]. 

|_ — 2A, 2A 2 + A +1J 

If A x = -1, A, = —f, A 3 = +i, A 4 = -i, then 

= (1, 1}, k 2 = {3, 4}, k s — {1 + i, — *}, k t = (1 —i, *}• 
Hence, using k x and k 2 , we have 

k[Ak a = 10, k[Bk t = 25, k’ x Ck 2 = 15. 

Equation (14) is accordingly 10A 2 + 25A+15 = 0, the roots of which 
are A = — 1 and A = — f. 
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9*12. Forced Oscillations of Aerodynamical Systems. The 

general theory of the forced oscillations of aerodynamical systems is 
rather complicated, and only a brief discussion can be attempted 
here. In recent years the subject has assumed importance in relation 
to such questions as the mass-balance of control surfaces of aeroplanes 
and the prediction of the critical speeds for wing flutter.! 

A dynamical system can be forced either by the application of 
known small periodic forces or by the imposition of known small 
periodic movements at one or more given points of the system. The 
method of construction of the equations of motion will be sufficiently 
illustrated if we assume that a simply sinusoidal force or movement is 
imposed at a single point Q and in the direction of the fixed reference 
axis Ox v Let the Cartesian coordinates of Q, referred to the fixed 
axes, be x 1} x 2 , x 3 at any instant of the motion, and be x%, x \, x% 
when the system is in its mean position. Then if the generalised co¬ 
ordinates are measured from the mean position, the first geometrical 
equation (see §8-9) yields, to the first order of small quantities, a 
relation of the type 

*i-*i = hVi+Vi qt+-+v m q m = v'v> 


where the coefficients are constants dependent on the position of 
Q in the system, and tj, q denote column matrices. 

Firstly, assume that a known force sin cot acts at Q in the direction 
Ox v and that no constraints are applied at Q in the directions Ox 2 
and Ox 3 . Then the corresponding set of generalised forces is 

P = F^ sin cot, 
and the equations of motion are therefore 

f(D) q = (AD 2 + 13D + C) q = p sin cot, .(1) 

where p = Ftf. 

Next suppose that Q is given a known motion d x sin cot in the 
direction Ox v and that as before no constraints are imposed in the 
directions Ox 2 and Ox 3 . Then the generalised forces corresponding to 
the unknown reaction X x at Q are given by P = X x 7j, while on account 
of the imposed motion at Q the generalised coordinates will be con- 
nected by the condition ^ = diaino)L . (2) 

In this case the differential equations of motion are represented by 
(AD 2 + BD + C)q = X 1 r/. .(3) 


t See, foi example. Refs. 33, 34, 35. The discussion given in this section is based on 
Ref. 36. 
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On elimination of the unknown reaction X 1 we obtain m — I differential 
equations connecting the m generalised coordinates 9 . These, taken in 
conjunction with ( 2 ), suffice to determine the motion. The elimination 
can be effected conveniently as follows. Write 

*=[ 1. -VJVv -ValV v •••> VmlVi~\ • 

0 , 1 , 0 , 

0 , 0 , 1 , 


L 0, 0, o, 1 J 

Then (2) is readily seen to be contained in the relation 

Substituting this expression for q in (3), and premultiplying the 
equation throughout by u', we obtain 

= {Xtfi.0 . 0 }. 

.(4) 

The last m-1 of the scalar differential equations in (4) are in¬ 
dependent of X v and they involve all the generalised coordinates with 
the exception of q v They can evidently be represented by a matrix 
equation similar in form to (1), except that in general there are present 
forcing terms in quadrature with the imposed sinusoidal displacement 
of Q. 

Prom the preceding discussion it will be clear that both of the cases 
considered will be covered if we assume the equations of motion to be 

of the type (AD* + BD+C)q = pairuot + pcos<irt, .(5) 

where p and p are columns of assigned constants. 

The complete motion which follows any given initial conditions may 
be regarded as the superposition of a free motion and of a forced 
motion. If the real part of any root of A(A) = 0 is positive, the free 
motion will in time-completely mask the forced motion. On the 
other hand, if the real parts of all the roots are negative, the forced 
motion will ultimately predominate. In the discussion which follows 
only the forced motion is considered. For simplicity the roots of 
A(A) = 0 are assumed to be all distinct. 

Let the forced motion in the coordinate q 8 be 

q a = R a sin (atf + ej, 


u\AD 2 + BD + C) u[— sin o)t , j 2 , q 3 ,..., q m 
m 


0 

0 
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and write with <f=where i denotes V — 1. Then, 

on substituting this trial solution in (5), we see that the constants <r 

must be such that x 

f(i<o)cr = p + ip, 

f(-ia))cr = p-ip. 

Provided ± uo are not roots of A(A) = 0, the 2m scalar equations con¬ 
tained in (6) give the values of cr and of uniquely. On solution of these 
equations the forced amplitude corresponding to q 8 is given by 



R 8 = mod (E 8 /j\(io))) t .(7) 

where E 8 is the determinant obtained when the 5 th column of A(ia>) 
is replaced by p + ip. To obtain the expression for the forced amplitude 
in an explicit form, suppose the expanded form of the determinant 
A( A) to be p 0 A"+p 1 A«-' + ...+p n _ l \+p n , 

where n = 2m. Then 


A(w)A(-iw) = P 0 Q. n -P 1 Q . n - 1 + ...-P n _ 1 tl + P n , 
where fi = a> 2 , and the coefficients are conveniently expressible as 
P o = Pl> 

P i ~ \.P&P 2 \ {PiiPoS Pv 
P 2 = [Po>Pz> {Pi, Pi, Po} - [Pi, Pi\ {>3, Pi}, 

and so on. In the same way, if E 8 denotes the conjugate of E 8i we can 
wnte E„ E a = c 0 Q n ~ 2 - e 1 Q»- 3 +... - c n _ 3 Q + e n _ 2 , 

where the coefficients e are real. Hence 


JR 8 = E 8 EJA(io)) A( — i(o) 

_ epQ n 2 ~e 1 Q n e n „3Q-l-e n _2 

P 0 £>-PiQ"- 1 + ...-P n _„ 1 Q + P n ' 


( 8 ) 


An alternative expression can be obtained if the roots \ v A 2 ,..., A n of 
A(A) = 0 are known. For then 

R*=E 8 E 8 IP 0 n^ + A?), 

r-1 

and on resolution into partial fractions, this gives 



n 

2 


a r 


rt> 2 + AT 


(9) 


a e o(^r) w ~ a +e 1 (A 2 ) n ~ 8 +... + e n _ 8 ( A?)+e»_ t 

r n(A|-A|) 


where 
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Since the expressions on the right of equations (8) and (9) are identical, 
and since the numerator in (8) is of degree n - 2in&>* while the denomin¬ 
ator is of degree n, it follows that 2 a r = 0. 

If the forcing frequency is regarded as a variable parameter, there 
will usually be certain values of this frequency for which the forced 
amplitude of the motion in any given coordinate q s is stationary. 
The equation which gives these frequencies can be obtained by differ- 


dR 

entiation of (8) or (9) with respect to to and use of the condition = 0. 


Hence 


n 

s 


0L r (O 


(to 2 + A 2 ) 2 


= 0 . 


This yields, apart from the trivial solutions to = 0 and to = oo, 


2 a r n (w 2 + A 2 ) 2 = 0. .(10) 

r—1 j+r 

Since Ea r = 0, equation (10) is in general of degree 4m — 3 in to 2 : only 
real positive roots to 2 of this equation yield real values of the frequency. 
The maxima and the minima of forced amplitude necessarily occur in 
alternation, and since to = oo corresponds to a minimum ( R s = 0), it 
readily follows that there can be at most 2m — 1 maxima for a system 
having m degrees of freedom. 

If two conjugate roots of A(A) = 0, say p r ± ito ri have their real 
parts fi r small, it is obvious from (9) that a pronounced peak value of 
R 8 will be obtained for some value of to close to to r . If the roots are 
actually critical, so that jLt r = 0, R 8 becomes infinitely great for to = <o r% 
This is the familiar phenomenon of resonance. For this condition the 
assumed form of solution fails, and in fact the forced motion is no 
longer simply sinusoidal but is constructed of constituents of the type 
t cos tot (see § 5* 11 (a)). The amplitude of oscillation then grows without 
limit. 


Example 


Forced Oscillations of a Model Aeroplane Wing. Consider the forced 
flexure-torsion oscillations of a cantilever wing. The laws of bending 
and twisting are assumed to be invariable for the range of airspeeds 
to be considered, and the wing is accordingly treated as semi-rigid 
(see §8-12). Fig. 9*12-1 is a diagram of the system. The generalised 
coordinates q v q 2 are chosen to be respectively the downward linear 
displacements at the points L, T of the wing tip which he on the 
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leading and trailing edges. It is supposed that the wing is given forced 
sinusoidal oscillations by means of an oscillator K connected through 
a spring of stiffness <r 0 to the point L. 

If the imposed motion at Q is tfsinotf, the dynamical equations in 
the present simple case are readily shown to have the form 


(AD 2 + BD + C) {(h, q 2 } = dcr 0 sin cjt {1, 0}, 
and the amplitudes R lt R 2 of the motions in q v q 2 are given respectively 


fey 


and 


±(RiV _ 

<ro\d) 

<r 0 \ d ) 


(- A 22 o) 2 + C 22 ) 2 + (o 2 B\ 


J 22 


( P 0 (*> 4 -P2U 2 +i> 4 ) 2 + to 2 (p x oj 2 -p 2 ) 2 

( - A 2 l 0 ) 2 + c 2l ) 2 + (i) 2 Bl x 
(Po <*> 4 ~P 2 ° ) 2 +P1) 2 + oj 2 (Pi ^ 2 -P3) 2 ' 


These formulae are applied in Ref. 36 to a light aeroplane wing of 
span 9 feet and chord 3 feet, for which the numerical data are as 
follows. The spring stiffness cr 0 is chosen to be 4 pounds per foot, and 
the terms independent of the airspeed V in the damping coefficients 
B i} represent an allowance for elastic damping. 


Dynamical Coefficients for Light Aeroplane Wing 


Coefficient 

Value x 9* 

Coefficient 

Value x 9* 

A tl 

5 443 

An 

2-907 


0-4466 F + 29 

B n 

- 0-3456 F -9 

Cn 

- 0-633 V' + 2502-7 

C,i 

-0-144 F*-614-7 

Ait 

2*907 

A u 

4-743 

Bn 

0-8076F -9 

B,t 


Cn 

0-633 F 1 - 614-7 

•1 

0-144 F* +1487-7 


The results are given in Fig. 9-12-2, which shows the influence of 
forcing frequency v^cjfen and airspeed V on R 2 . The critical speed 
for flutter of the wing works out at 67-8 feet per second, and the 
corresponding critical frequency is 3-2 cycles per second. It is seen 
that R 2 tends to become infinite for this frequency as the critical speed 
is approached. 










CHAPTER X 


ITERATIVE NUMERICAL SOLUTIONS OF 
LINEAR DYNAMICAL PROBLEMS 

10* 1. Introductory. In the present Chapter the iterative methods 
described in §§4-17 and 4-18 will be applied to obtain approximate 
solutions of problems relating to the small oscillations of dynamical 
systems. Iterative methods are particularly advantageous when the 
number of degrees of freedom is large, since the solution is obtained 
without expansion of the Lagrangian determinant. Moreover, they 
often lend themselves well to the approximate treatment of continuous 
systems such as tapered beams. 

Conservative systems are considered in Part I, and the extension 
to dissipative systems is briefly dealt with in Part H. 

Pabt I. Systems with Damping Forces Absent 

10-2. Remarks on the Underlying Theory. With a conservative 
system the Lagrangian equations of free motion have the form (see 
§9-3) Ai+Eq- 0, 

in which both the inertia matrix A and the stiffness matrix E are 
symmetrical. It will be assumed that both A and E are non-singular, 
as is usually the case. In §9*9 it has been shown that the typical 
constituent motion may be taken as 

q = (eV+ e -V)fc r , 

and that AJ and all the elements of the modal column k r are real. 


Moreover, (A** + E) k r » 0. .(1) 

For the application of the iterative method it is convenient to write 
— A 2 = z = T27“\ and to express (1) in the alternative forms 

(w r I—U)k r = 0, .(2) 

(z r I-U~')k r = 0 , . (3) 


where U = E~ l A . The matrix U will be spoken of as the dynamical 
matrix * From (2) and (3) it follows that m r is a latent root of U > and 
that z r is a latent root of ?7 -1 . 

♦-Note that this matrix is, in general, not symmetrical. 
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Equation (2) could be treated in the usual way by the direct solution 
of the characteristic equation of U. However, the iterative method is 
often preferable, more especially when the constituents are known 
to be all oscillatory and only the fundamental mode (which corresponds 
to the dominant root w r ) is required. If the mode of highest frequency 
is required, the iterative method should be applied to equation (3). The 
details of the process of solution are amply explained in the examples* 
given later, but some preliminary general remarks may be useful. 

(a) To construct the dynamical matrix V it is often advantageous 
to form the flexibility matrix O = E~ x directly rather than to derive 
it by inversion of the stiffness matrix. The displacements q due to a 
set of generalised static loads W are given by 

q = <S>W. 

Hence to find the element it is only necessary to obtain the value of 
q i when WJ = 1 and all the remaining generalised loads are zero. This 
direct method for the construction of <S> is particularly valuable in 
problems on the vibrations of beams (see §§ 10*4, 10*6). 

(b) When merely the fundamental frequency and mode are required, 
the solution is directly obtained by repeated premultiplications of an 
arbitrary column by U. If the frequencies and modes in the overtones 
are required, it is necessary to evaluate modified matrices by the 
methods of § 4*18, and to use these matrices for the iterations instead 
of U . The required row K r can be found very simply from the corre¬ 
sponding column k r when the system is conservative. For equation (2) 
can be written <bAk r = m r k r , which yields on premultiplication by A 
and transposition k' r A^A = k' r AU = w r k' r A. 

Hence k t may be chosen to bek' r A. This conclusion accords with the 
orthogonal properties of the modes discussed in §9*11. 

(c) The work of iteration will be considerably shortened if by ex¬ 
perience or intuition an initial column can be chosen which is not very 
different from the true mode. 

(d) The case of equal roots, which is somewhat unusual in dynamics, 
requires special consideration. The form of a high power of a general 
matrix u which possesses a dominant latent root of multiplicity s has 
been given in §4*15, and it is easy to see how such a root may be 

* The examples in §§ 10-3, 10-4 and 10-6 are taken from Ref. 10, and that in § 10*11 
from. Ref. 11. 
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obtained from u m x^ i where x 0 is arbitrary. However, in the case of the 
dynamical matrix U of a conservative system, there is a considerable 
simplification. It can readily be deduced from the results of § 9*9 that 
if vj 8 is a latent root of U of multiplicity s, the derived adjoint matrices 

olw 8 I—U up to F(m a ) all vanish. Hence the only term arising from 
the multiple root m 8 in Sylvester’s expansion of U m is proportional 
(8-1) m % . 
to m™F(w a ). Accordingly, the usual iterative method gives w 8 as 

though it were a simple root, and the 8 modes are proportional to the s 

( 8 - 1 ) 

linearly independent columns of F(vj 8 ). If a different arbitrary columns 
are repeatedly premultiplied by U> 8 different possible modes will, in 
general, be obtained. It may also be noted that the canonical form of 
U is diagonal (see example (iii), § 9*10). 


(e) When an accurate value of the fundamental frequency is re¬ 
quired, but no great interest attaches to the mode, an application of 
Rayleigh’s principle (see (9* 11-9)) may be found advantageous. The 
principle states that small changes in the mode do not affect the 
frequency. Hence an approximate mode, given for instance by an 
uncompleted application of the iterative method, can be used to deduce 
a relatively accurate frequency. When the equations of motion are 
in the form (1), equation (9*11*10) may be applied. On the other 


hand, if the iterative method is being 
used, a convenient modification is ob¬ 
tained by premultiplication of (2) by k' r . 
This yields 

m r = k' r Uk r lk' r k r .(4) 

An illustration is given in § 10*4. 

10*3. Example No. 1: Oscillations 
of a Triple Pendulum. To provide a 
simple example we shall first consider 
the oscillations of a triple pendulum 
under gravity in a vertical plane (see 
Fig. 10*3*1). The three coordinates q v q 2i 
q z will be taken to be the small horizontal 



Fig. 10-3*1 


displacements of the masses m v ra 2 , m 3 , respectively, from the equi¬ 


librium position. Let unit force be applied horizontally to m x \ then 


the three masses will each be displaced a distance a = + m 2 -f m 3 ), 
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so that {a, a, a } = G>{1,0,0} = {O u , <D 21 ,® 31 }. When unit force is applied 
horizontally to m ti m 1 will again be displaced a distance a, but m 2 and 
mj will each move a distance a + b, where b = l 2 lg{fn 2 + fn z ). this 

manner the matrix Q is readily found to be 


a, 

a, 

a 

a, 

a+b, 

a + b 

a. 

a + b, 

a+b + c 


<t> = 


where c == l 3 /gm s . 

If W denotes a set of horizontal forces applied to the masses, the 
displacements are given by q = 0>W. Suppose now that W represents 
the inertia forces. Then 



SB 

"a, a, a 


— m i9i 



a y a + b, a + b 


-m 2 q 2 



a, a + by a + b + c_ 


.~m 3 q 3 _ 


= - 

m^y 

m 2 a, 

m 3 a 


~9i 


m x a. 

m 2 (a + b). 

m 2 (a + b) 




m x a. 

m 2 (a+b), 

m 3 {a + b+c)_ 


-? 3 . 


The dynamical equations have here been constructed directly in the 
form q = — 17$. 

When the system is oscillating in a single mode, we may take q 
proportional to and then 

(J + AJl7)* r »0, 

or (w r I-U)Jc r = 0. .(1) 

As a numerical example suppose that m x = m 2 = m 3 = m and 
/j == 1> 2 == Z 3 == l. Then j 

U ~*9 


2 2 2 
2 5 6 

2 5 11 


If the scalar factor is absorbed into m, so that now m is taken to 
mean — 6gr/A 2 Z, equation (1) becomes 

wlc = uk y 

where u is the numerical part of U . This last equation is readily solved 
by the method of iteration. The dominant value of m and the associated 
column k 9 which correspond to the fundamental oscillation, are quickly 
obtained when an arbitrary column, say {1,1,1}, is repeatedly pre- 
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multiplied by u. Thus 

«{1,1,1} = {0,12,18} = 18{0-3,0-6,1}, 

«{0-3,0-6,1} = {4,9,15} = 15{0-26,0-6,1}, 

«{0-26,0-6,1} = 14-53(0-25688,0-58716,1}, 
and after nine steps in all, 

u{0-254885,0-584225,1} = 14-4309(0-254885,0-584225,1}. 

1 F*g 

The fundamental frequency is given by 27 r J faT’ an< * a °l ose 
approximation to its value is thus 


i r~ 

2n V 14- 




= 0-102624 


A 


2n V 14-43091 

Moreover, the corresponding modal column k x is proportional to 
{0-254885,0-584225,1}. It will be observed that at each step in the 
iteration a homologous element in the column is for convenience 
reduced to unity. 

If the overtones are required, the methods of § 4-18 may be applied. 
It is necessary in the first place to find the row K lt where tu 1 k 1 = k x u. 
Since A is here a scalar matrix, we deduce at once that (see § 10-2(6)) 

K 1 = k’ 1 = [0-254885,0-584225,1]. 

Now in any oscillation from which the fundamental is absent K x q = 0. 
Hence we may write 

0-254885g' 1 + 0-584225? 2 + ? 3 = 0, 

or =—2-29211(72—3-92334^3. 

Accordingly, when the fundamental is absent 

'0 -2-29211 -3-923341 
0 1 0 

0 0 1 

Substitution in the right-hand side of the equation wq = uq yields 
wq = vq, where 


•(2) 


V 

= 

ft 


-ft- 




*?r 


ft 


-ft. 


v = 


2 

5 

5 


2 

5 

11 



'0 

-2-29211 

-3-92334' 


0 

1 

0 


0 

0 

1 


0, -2-58422, 
0, 0-41578, 

[0, 0-41578, 


-5-84668' 

-2-84668 

3-16332 
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This matrix is, of course, identical with that given by the formula 

(4*18*3) with r = 1, namely v = u —— { u n}^v 

K u 

We now proceed by the iterative method, starting with an arbitrary 
column, say {1,1,1}. In the process of approximation it is evidently 
unnecessary to compute the leading element of any column, since 
this is always multiplied by a cipher in the succeeding step. In this 
way we find 

v{l, 1,1} = -2*4309,3*5691} = 3*5691{—, -0*68110,1}, 

v{— , -0*68110,1} = , -3*1299, 2*8701} 

= 2*8701{—, -1*09049,1}, 

and after fifteen approximations the column repeats itself, the scalar 
factor being 2*6152. A computation of the leading element then gives 

v{- 0*95670, -1*29429,1} = 2*6152{-0*95670, -1*29429,1}. 

Hence in the first overtone the mode is (-0*95670, — 1*29429,1}, and 

the frequency is ^ - 0-24107 Ji. 

To determine the second overtone we note that the condition for 
absence of the first overtone is = 0, ° r 

[-0*95670, - 1*29429, l]q = 0, 

whence q x = — l*35287g 2 + l*04526g 3 . .(3) 

Elimination of q x between (2) and (3) yields 
0 = 0*93924^ 2 -f4*96860g3, 
or* q 2 — — 5*2900g 3 . 

Hence for a motion in which the fundamental and the first overtone 
are both absent we may write 


V 

= 

- 1 , 0 , 0 


V 

<72 


0 , 0 , - 5-2900 


?2 

-? 3 - 


0, 0, 1 _ 


_? 3 . 


* This result could also have been obtained by repeated postmultiplication of an 
arbitrary row by v, which yields [0,1, 5'2899]v = 2*6152[0,1, 5*2899]: see also footnote 
to §4*18. 
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1, 

0 , 

0 " 


0 , 

0 , 

-5-2900 


0 , 

0 , 

1 
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and making this substitution on the right-hand side of the equation 
wq = vq, we obtain wq = wq, where 

w=rO, -2-58422, -5-84668' 

0, 0-41578, -2-84668 
0, 0-41678, 3-15332 
0, 0, 7-8238' 

0, 0, -5-0461 
L0, 0, 0-9539 

The iterative process here yields, in one step, the result 

w{8-2019, -6-2900, 1} = 0-9539(8-2019, -5-2900, 1}. 

The mode is thus proportional to (8-2019, —5-2900, 1} and the 

frequency is found to be 0-39916 

The second overtone could alternatively have been deduced directly 
from the inverse of the matrix u. It is readily found that 

'5-2 O') 

-2 3 —11 

0-1 lj 

mid the iterative process when applied to this matrix yields the result 
tt-*{8-20180, -5-28994,1} = 1-048323(8-20180, -6-28994,1}. 

The mode agrees well with that found previously, while the frequency 
workB out as 


it- 1 = l 


J / 6 x 1-04 
2ir*J l 


048323(7 


0-399157 


V 


10*4. Example No. 2: Torsional Oscillations of a Uniform 
Cantilever. An illustration will next be given of the use of the method 
in relation to continuous systems. As an approximation the given 
system is replaced by a finite system of rigid or semi-rigid'* units, 
suitably interconnected. In order that this finite system may reproduce 
closely the behaviour of the continuous system, the number of such 
units must usually be large. 

Suppose the given uniform cantilever to be divided into ten equal 
sections. These sections are assumed to be rigid, and to be inter¬ 
connected by torsion springs, the stiffnesses of which are so chosen that 

* For definition of “semi-rigid” systems see § 8*12. 
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the torsional stiffness for the complete finite system measured at the 
middle of each section agrees with that for the continuous beam at the 
corresponding position. The coordinates q are taken to be the angular 
displacements of the segments from the equilibrium positions. Let 8 
be the span of the beam, C the torsional stiffness per unit length and P 
the polar moment of inertia for unit length. 

The flexibility matrix is easily found to be 



0-05 

0-05 

0*05 

0*05 

0*05 

0*05 

0*05 

0*05 

005 

0-05' 

005 

015 

0*15 

0*15 

015 

0*15 

0*15 

0*15 

015 

015 

0-05 

0-15 

0*25 

0*25 

0*25 

0*25 

0*25 

0*25 

0-25 

0-25 

005 

015 

0*25 

0*35 

0-35 

0*35 

0*35 

0*35 

0-35 

0-35 

0-05 

015 

0*25 

0*35 

0*45 

0*45 

0*45 

0*45 

0-45 

0-45 

0-05 

0-15 

0*25 

0*35 

0-45 

0*55 

0*55 

0*55 

0-55 

0-55 

0-05 

0-15 

0-25 

0*35 

0*45 

0*55 

0*65 

0*65 

0-65 

0-65 

0-05 

015 

0*25 

0*35 

0*45 

0*55 

0*65 

0*75 

0-75 

0-75 

0-05 

0-15 

0*25 

0*35 

0*45 

0*55 

0*65 

0*75 

0-85 

0-85 

0-05 

0-15 

0*25 

0*35 

0-45 

0*55 

0*65 

0*75 

0-85 

0-95. 


and the inertia matrix is evidently OlsP times the unit matrix 


The dynamical matrix U is thus simply the numerical matrix u 




multiplied by the scalar 0-ls 2 P/C. Hence the iterative process, when 
applied to the matrix u , will yield the dominant value of 


m = - 10C/s 2 A 2 P. 


To illustrate the convergence, the initial mode chosen in Table 10*4*1 
differs widely from the true fundamental mode. For the purpose of 
comparison, the exact solution for the continuous beam is tabulated 
beside the solution obtained for the segmented beam. 

Rayleigh’s principle will next be applied to determine the frequency 
from an approximate mode. Assume the third column of Table 10*4*1 
to be the approximate mode k r : then the fourth column is Uk r . Hence 


k' r Uk r = (0*0752 x 0*3139) -f (0*2241 x 0-9342) + ... 4-(1*0 x 4*0100), 
and k' r k r = 0-0752 2 + 0-2241 2 +... + 1*0 2 . 

Accordingly 

w r = k' r Uk r lk' r k r = 19*8630/4*8916 = 4*0606. 

This value for m r is identical with that found in the eighth column, and 
is very accurate for the segmented beam. 
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Table 10*4* 1 

Fundamental Torsional Oscillation of a Uniform Cantilever 


Initial 

column 




Iteration number 



Exact 

solution 









for con- 









tinuous 



1 

$ 


% 


A 


beam 

mm 

0-25 


0*3139 

0*0783 



0*3196 



0*15 

0*746 

0*2241 

0*9342 

0*2330 

0*9492 


0*9508 

0*2342 

0*2342 

0*26 

1*225 

0*3684 

1*5321 

0*3821 

1*6562 

0*3837 

1*5587 

0*3839 

0*3839 

0*36 

1*68 

0*5053 

2*0932 

0*5220 

2*1249 

0*6239 

2*1281 

0*6241 

0*5241 

0*46 

2*10 

0*0316 

2*6037 

0*6493 

2*6414 

0*6612 

2*6452 

0*6514 

0*6516 

0*55 

2*475 

0*7444 

3*0511 

0*7009 


0*7626 

3*0972 

0*7627 

0*7628 

0*66 

2*795 

E52&I 

3-4240 

0*8539 

3*4685 

0*8561 

3*4729 

KTC331 

0*8563 

0*75 

3*05 

0*9173 

3*7129 

0*9259 

3*7586 

0*9267 

3*7631 

0*9267 

0*9267 

0*85 

3*23 

0*9714 

3*9100 

0*9751 

3-9561 

0*9753 

3*9600 

0*9754 

0*9754 

0*96 



4*0100 

1*0 



4*0606 



w 

3*325 

0*95 

= 3-60 

4*010 

4*056 

4*061 

4*053 


10-5, Example No. 3: Torsional Oscillations of a Multi- 
Cylinder Engine. In this example a multi-cylinder engine is supposed 
to be coupled to an airscrew of very great inertia. The rate of revolution 
of the airscrew is assumed constant and the system can therefore be 
treated as though the airscrew were fixed. The problem is now very 
similar to that dealt with in § 10-4 except that the stiffnesses and 
inertias of the segments may be variable along the shaft. 



Jx 

Jz 


A 

J S 

C\ 

C t 

C 3 

C 4 

Cs 

C 6 


Fig. 10*5*1 


Let the moments of inertia of the rotating masses, augmented by an 
allowance for the reciprocating masses,* be J v J 2 , ...,«/ n , and let the 
stiffnesses of the sections of the crankshaft be C v C 2 ,..., C n , as shown in 
Fig. 10*5‘1 for the case n = 6. The n coordinates q are again chosen to 
be the angular displacements from the equilibrium positions. Suppose 
unit torque to be applied at J x ; then q x = q 2 =... = q n = 1/<7 X =/ 1? 
say. If unit torque is applied at J 2 the displacements are q x = f x and 

* This allowance corresponds to half of the reciprocating mass supposed situated at the 
centre of the orankpin. 
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?2 = ?3 == • • • = 2w = A +A> where / 2 = 1 /C 2 . Proceeding in this way we 
find that 


<D = 


fi> 


fv 

A +A> 


A> 

A 


A 

A+A 


which gives 






J7 = 


*4A> «4A> 

*^iA» «4(A +A 


A+A+A. 

*4A> 

A(A "h A)> 


•••» A+A+--+/nJ 


*^nfl 

«4(A+A) 


LAA A(A+A)> «4(A+A+A)» •••> *4(A+A+-**+/n)J 

As a numerical example assume n = 6; J x = J 2 =... = J 6 = J; and 
<? 2 = C 3 =... = <7 e = (7, with Ci = 0-5C. Then 



2 3 4 
2 3 4 
2 3 4 
2 3 4 


2 2 

3 3 

4 4 

5 5 
5 6 
5 6 


2 ~ 

3 

4 

5 

6 
7 


J 

= say. 


Table 10-5-1 

Fundamental Torsional Oscillation of a Six-Cylinder Engine 


Initial 

Iteration number 

column 

1 

2 

3 

4 

0 

2 

0-29 

7*72 

0-389 

8-980 

0-4014 

9-1046 

0-40285 

0 

3 

0-43 

11-29 

0-568 

13-081 

0-5848 

13-2555 

0-58652 

0 

4 

0*57 

14-43 

0-727 

16-614 

0-7427 

16-8216 

0-74431 

0 

5 

0-71 

17-00 

0-866 

19-420 

0-8681 

19-6450 

0-86924 

0 

6 

0-86 

18-86 

0-950 

21-370 

0-9553 

21-6003 

0-95575 

1 

7 

10 

19-86 

10 

22-370 

1-0 

22-6003 

1-0 


Iteration number 

5 

6 

7 

9-11734 

0-40301 

9-11870 

0-40303 

9*11884 

0-40303 

13-27316 

0-58671 

13-27504 

0-58673 

13-27523 

0-58673 

16-84246 

0-74448 

16-84467 

0-74450 

16-84489 

0-74450 

19-66745 

0-86935 

19-66982 

0-86936 

19-67005 

0-86936 

21-62320 

0-95580 

21-62562 

0-95580 

21-62585 

0-95580 

22-62320 

1-0 

22-62562 

1-0 

22-62585 

1-0 







318 


OSCILLATIONS OF A CRANKSHAFT 


10-5-10*6 


Hence if m = —- 75-7 the equation to be solved is mq — uq. The com- 
ArJ 

put&tions in the iterative process, with { 0 , 0 , 0 , 0 , 0 , 1 } chosen as the 
initial column, are summarised in Table 10-5*1. 

It will be seen that seven iterations are sufficient to determine the 
mode to five significant figures, and the value of w is then 22-626. 

In a treatment of the same problem by Carter* an equation equi¬ 
valent to the characteristic equation of u is given with certain terms 
kept general. With the data chosen for the present example his 
equation can be written 

TZ 7 6 — 27m 6 + 105T27 4 — 140tzt 3 + 81tz7 2 — 21ti7 + 2 = 0, 
and it can be verified that the dominant root is the value of m just 
computed by the iterative process. 

10*6. Example No. 4: Flexural Oscillations of a Tapered 
Beam. We shall next find the fundamental mode and frequency of 
flexural oscillation of a cantilever beam which closely resembles an 
untwisted airscrew blade. The beam chosen is such that the differential 
equation governing the flexural oscillations has a known exact solu¬ 
tion, which will be compared later with the solution given by the 
iterative method. The symbols to be used are as follows: 

A = sectional area at a current point, 

B = flexural rigidity at a current point, 
o* = constant density of the material, 

8 = span of the beam, 

7)8 = distance of a current point from the root, 
q = amplitude of oscillation, 

A 0 , B 0 = values of A, B at root. 

The beam is specified by the equations 

A = A 0 (l-ij), 

(l-i/) S! (184 + 258i/ + 222i/ !! +76i/ 3 -76i/ 4 ) 

184(1 + 151/) 

The displacement in the true fundamental mode is proportional to 

6// 2 +28i/ 3 — 15i/ 4 , 

and the corresponding frequency is given by 


* Ref. 37. 
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For the iterative treatment the beam is supposed to be divided into 
ten segments of equal length. Each segment is replaced by a particle 
of mass equal to that of the segment and situated near the appropriate 
centroid, the position being chosen to provide round numbers for the 
fraction of the span, namely 0 05 , 0 * 15 , 0 - 25 , 0 - 35 , 0 - 45 , 0 - 55 , 0 - 65 , 0 - 75 , 
0 * 84 , 0 * 93 . The particles are assumed to be interconnected by springB 
in such a manner that their displacements under a system of static 
loads are identical with the displacements of the continuous beam at 
the corresponding positions under the same conditions of loading. 
Since the flexibility matrix is symmetrical, it is sufficient to consider 
the case where the load is external to the point at which the displace¬ 
ment is measured. Let unit load be applied at a point L distant sy L 
from the root; then the bending moment at a current point P distant 
87 j from the root (where ij < rj L ) is s(tj l — rj). The curvature at P is there¬ 
fore — The bending of an element of length sdy at P causes 
a rotation of the part of the beam external to P of amount s 2 {rj L — y)drjjB , 
and the resulting displacement of a point D distant 8rj D from the root 
is 8*(rj L — 7]) (7j D — 7j)drjlB. The total displacement at D due to the unit 
load at L is thus r Vj) 

2= I s3 (Vl ~ V) (Vd ~ V) dqlB, 

where rf<rj D < rj L . This integral is computed with the aid of Simpson’s 
rule to give the flexibility coefficients. 

The mass of any segment is given at once by jcrAsdi? taken over the 

segment, and the inertia matrix consists simply of the diagonal matrix 
of the masses. The dynamical matrix is finally given by 


U = 

where u is the numerical matrix 


8*<rA Q 

Wb 0 u ’ 


*00048 

00170 

0-0278 

0-0338 

0-0374 

0-0378 

0-0350 

0-0290 

0-0105 

0-0072- 

0 0200 

01539 

0-2895 

0-3848 

0-4383 

0-4504 

04224 

0-3530 

0-2396 

0-0890 

00351 

0-3281 

0-8018 

1-1895 

1-4256 

1-5089 

1-4399 

1-2190 

0-8340 

0-3122 

0 0404 

0-5032 

1-3725 

2-3550 

3-0453 

3-3530 

3-2778 

2-8105 

1-9500 

0-7361 

0 0646 

0-6774 

1-9440 

3-5990 

6-1304 

5-9868 

60473 

5-3138 

3-7247 

1-4204 

00798 

0-8500 

2*5143 

4-8431 

7-3172 

9-1962 

9-7464 

8-8148 

6-2893 

2-4299 

0 0050 

1-0260 

3-0855 

6*0872 

9-5029 

12-5311 

14 1421 

13-3540 

9-7689 

3-8417 

01102 

1-2002 

3-6670 

7-3307 

11-6902 

15-8666 

18-6956 

18-7320 

14-2533 

57557 

01235 

1-3674 

4-1700 

8-4500 

13-6570 

18-8680 

22-7941 

23-7555 

19-0581 

8-0196 

.01368 

1-5130 

4-6838 

9-5700 

15-6250 

21-8691 

26-8919 

28-7782 

24-0587 

10-8837. 

If m = 

— lO z Bj8 A <rA 0 A 2 , the equation for solution is wq 

= uq> 

in which 


q here represents the column of the displacements of the ten particles, 
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The solution by iteration is shown in Table 10-6-1, where the 
arbitrary initial mode is chosen to be parabolic. Since the amplitudes 
near the tip are liable to rather larger errors than elsewhere for highly 
tapered beams, the amplitude at 0-75 of the span is chosen as the 
standard of reference, and is reduced to unity at each step. The funda¬ 
mental mode and the corresponding value of vs are determined in five 
steps, and compare well with the exact solution for the continuous 
beam. 

Table 10-6-1 

Fundamental Flexural Oscillation of a Tapered Beam 


V 

Initial 

column 

Iteration number 

1 

2 

0*05 

015 

0-25 

0*35 

0-45 

0*55 

0*65 

0-75 

0*84 

0*93 

0*004 

0*040 

0-lli 

0*217 

0*36p 

0*537 

0*75i 

1*0000 

1*2544 

1-5376 

0*1358 

1*5916 

5*5129 

12*4446 

22*8059 

36*7092 

54*0074 

74*2962 

94*5355 

115*9423 

0 00183 
0*02143 
0*07420 
0*16749 
0*30696 
0*49410 
0*72692 
100000 
1*27243 
1*56055 

0*1288 

1-5502 

5*2631 

11*9308 

21*9603 

35*5004 

52*4388 

72*3938 

92*3589 

113*5088 

0*00177 

0*02142 

0*07271 

0*16481 

0*30325 

0*49037 

0*72436 

100000 

1*27578 

1*56794 

ZT 

74*296 

72*394 


Iteration number 

Exact 
solution 
for con¬ 
tinuous 
beam 

3 

4 

5 

0*1284 

1*5458 

5*2493 

11*9025 

21*9144 

35*4370 

62*3618 

72*3097 

92*2751 

113*4305 

0*00177 

0*02138* 

0*07260 

0*16460 

0*30307 

0*49007 

0*72413 

1-00000 

1*27610 

1*56868 

0*1284 

1*5455 

5*2483 

11*9004 

21*9110 

35*4325 

52*3565 

72*3043 

92*2702 

113*4266 

0*00177 

0*02138 

0*07259 

0*16459 

0*30304 

0*49005 

0*72412 

1-00000 

1*27615 

1*56875 

0*1284 

1*5455 

5*2482 

11*9003 

21*9109 

35*4324 

52*3564 

72*3044 

92*2705 

113*4272 

000177 

0*02138 

0*07259 

0*16459 

0*30304 

0*49004 

0*72411 

100000 

1*27614 

1*56875 

0*00177 

0*02126 

0*07221 

0*16381 

0*30182 

0*48863 

0*72279 

100000 

1*27964 

1*57935 

72*310 

72*304 




10*7. Example No. 5: Symmetrical Vibrations of an 
Annular Membrane. The transverse vibrations of an annular 
membrane have already been discussed in example (iii) of § 7-10 and in 
§ 7-15. The same problem will now be solved by the iterative method. 
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If the annulus is bounded by the radii r v r 2 , and if unit load is dis¬ 
tributed evenly round a circle of radius r, then for the small 
displacement q of a circle of radius p is readily shown to be 


_ log (?g/r) log (p/?i) 
q 2nT log (rJ r i) ’ 

where T is the tension per unit length of the membrane. In the 
problem previously considered r x = 1 , r 2 = 2 : hence 


log (2/r) logp 
^ 27rI 7 log2 




( 1 ) 


Suppose the membrane to be divided into ten annuli of equal width, 
having mean radii 1-05, 1*15, 1*25, etc. Let each annulus be replaced 
by a massive circular ring at the mean radius of the annulus, and 
let these rings be interconnected by springs in such a manner that 
the displacements under a static load system equal the displace¬ 
ments of the continuous membrane under the same load system. 
The flexibility matrix is then determinable from the equation ( 1 ): 
the case p ^ r need only be considered since the matrix is symmetrical. 
The inertia matrix is again the diagonal matrix of the masses and 
is easily found. The final equation is wp = uq , in which u x 10 3 is 
the matrix 


“ 6*226 

5*856 

5*406 

4*883 

4*291 

3*636 

2921 

2*150 

1*327 

0*454 - 

6*347 

16*776 

15*487 

13-987 

12*292 

10*415 

8*367 

6*160 

3*802 

1*301 

4*641 

14*248 

24*727 

22*332 

19 625 

16*628 

13*359 

9*835 

6*070 

2-078 

3*798 

11*915 

20*678 

30*034 

26*394 

22*363 

17*967 

13*227 

8*164 

2*794 

3*107 

9*748 

16*918 

24*574 

32*679 

27*688 

22*245 

16*377 

10*108 

3*460 

2*463 

7*727 

13*410 

19*477 

25*902 

32*658 

26*237 

19*316 

11*922 

4*081 

1*859 

5*832 

10*121 

14*700 

19*548 

24*647 

29*980 

22*072 

13*623 

4*663 

1*290 

4*048 

7*025 

10*204 

13*569 

17*108 

20*810 

24*665 

15*223 

5*211 

0*753 

2*363 

4*102 

5*957 

7*922 

9*989 

12*150 

14*400 

16*735 

5*728 

_ 0*246 

0*768 

1*332 

1*935 

2*573 

3*244 

3*946 

4*677 

5*435 

6*219 . 



where m denotes the mass per unit area of the membrane. 

In Table 10-7-1 the initial arbitrary column chosen is parabolic, and 
yields zero displacement at the edges of the membrane. The funda¬ 
mental mode is proportional to the last column in the Table, and the 
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corresponding value of or is 0-13514. To provide a comparison with the 
results of §§ 7*10 and 7-15, make the substitution 


10 logio 2 
w log e 10' 

Then the value of K works out as 9*674, in comparison with the 
figure 9*763 obtained from tables of Bessel functions. 


Table 10*7*1 

Fundamental Transverse Vibration of an Annular Membrane 


Initial 

Iteration number 

column 

1 

2 

3 


xl0- s 


xlO-* 


x 10~ 3 


■ > In 

25-86 

0-184 

25-24 

0-186 

26-203 

0-1862 

m 3j9| 

71-69 

0-511 

70-02 

0-515 

69-875 

0-5164 

m ?s| 


0-764 

104-66 

0-769 

104-362 

0-7712 

m 


0-929 

127-01 

0-934 

126-523 


m 

■ 

1-000 


1000 

135-320 



136-95 

0-976 

131-85 

0-969 

130-887 

0-9672 

B 

121-12 

0-863 

115-54 

0-849 

114-464 

0-8458 


94-55 

0-674 

89-24 

0-656 

88-213 

0-6519 


59-79 

0-426 

55-83 




019 


0-144 

18-76 

0-138 

18-494 

0-1367 | 




Iteration number 

* 


4 

5 

6 

xl0- a 


x 10 -8 


x 10~ 8 


25-200 

0-1864 

26-202 

0-1865 

25-203 

0-1865 

69-864 

0-5168 

69-867 

0-5170 

69-869 

0-5170 

104-326 

0-7718 

104-326 

0-7719 

104-326 

0-7720 

126-437 

0-9364 

126-425 

0-9365 

126-423 

0-9355 

135-174 

1-0000 

135-147 

1 0000 

135-140 

1-0000 

130-685 

0-9668 

130-644 

0-9667 

130-633 

0-9667 

114-223 

0-8450 

114-173 

0-8448 

114-161 

0-8448 

87-996 

0-6510 

87-947 

0-6508 

87-935 

0-6507 

54-937 

0-4064 

54-903 

0-4062 

54-893 

0-4062 

18-438 

0-1364 

18-426 

0-1363 

18-423 

0-1363 
_1 


10*8. Example No. 6: A System with Two Equal Frequencies. 

Consider the system of light rigid rods shown in Fig. 10*8*1. The rods 
AB , CD, EF , each of unit length, swing about the axis AE under the 
constraint of springs of stiffnesses respectively, and are also 

jointed to the rigid rod BF. The lengths A C , CE , BD, and DF are all 
equal. Lastly, the rod DO, which is of unit length, swings about the 
axis BF under the constraint of a spring between CD and DO of 
stiffness Masses 4, 4, 1 are carried at B, F, O, respectively. 
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Take as generalised coordinates q lt q 2 , q 2 the linear displacements 
of B, F, 0 , respectively. Then the displacement of D is i(?x+? 2 ). In 
a general static displacement of the system, the elastic moments at 
A, C,E,D, will be \q v \{q l + q 2 ), \q 2 , \(q z -q±- q 2 ), and since the lever 
arms are all of unit length, the vertical forces at B, D, F, 0 are also 
4?i. §(?i + 9'*)> tea. i(?s — ?i _ ?a)- To find the flexibility matrix, apply 
unit load at B, F, G in succession. When unit load is applied at B, we 
have by moments about AE, 

1 = tei + § (?i+?a) + tea = ?i+?a> 
while by moments about AB, 

tea + f (?i + ?a) = 0, or q t + 2q 2 = 0. 



Hence q 1 = 2, q 2 = — 1, and since the moment at D is zero, 
?3 = Si + tfa = 1- The displacements are thus {2, — 1, 1}. Similarly, 
when unit load is applied at F, the displacements are { — 1,2,1}. When 
unit load is applied at O, we have by moments about AE, 

2 = tei + f (?i + q 2 ) + tea = ?i + ?a> 

and, since the displacement is symmetrical, q 2 = q 2 = 1. Moreover, by 
moments about BE, 

1 = i(?3-?i-9 , a) or q 3 = 11. 

Hence in this case the displacements are {1,1,11 }. The flexibility 
matrix is thus 


The inertia matrix is evidently 

A = T4 0 O' . 

0 4 0 

0 0 1 . 
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Hence the dynamical matrix is 


U = <t>A = 


8 

— 4 
4 


— 4 
8 
4 


n. 

i 

li 


If the initial column for the iterative solution of mq — Uq is chosen to 
be {0,0,1}, the results are as tabulated below. 


Initial 

Iteration number 

column 

1 

2 

3 

0 

1 

0091 

1*364 

0116 

1*464 

0*123 

0 

1 

0091 

1*364 

0*116 

1*464 

0*123 

1 

11 

10 

11*728 

10 

11*928 

1*0 


Iteration number 

4 

5 

6 

1*492 

0*124 

1*496 

0*125 

1*500 

0*125 

1*492 

0*124 

1*496 

0*125 

1*500 

0*126 

11*984 

1*0 

11*992 

1*0 

12000 

1*0 


The corresponding mode is thus {0-125,0-125,1-0}, while m = 12. 
On the other hand, when {0,1,0} is adopted as the initial column, 
the results are as follows: 


Initial 

Iteration number 

column 


2 

3 

4 

0 

-4 




-12*6 


-12*144 

-1*0 

1 

8 



1*4 

16*2 

1*286 

16*288 

1*259 

0 

4 




12*6 


12*144 

1*0 


Iteration number 

5 

6 

7 

-12*036 

16*072 

12*036 

■■ 

-12*008 
15*016 
12008 

-1*0 

1*260 

1*0 


-1*0 

1*260 

1*0 


Hence again m = 12, but the mode is now {-1-0,1-250,1-0}. The 
system therefore has two equal frequencies given by m = 12, and the 
two corresponding modes may be arbitrary linear combinations of 
{0-125,0-125,1-0} and {—1-0,1-250,1*0}. Subtraction of the second of 
these columns from the first yields {M25, -1-125, 0}, so that the 
modes may conveniently be taken as {1, -1, 0} and {1,1, 8}. The first 
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represents a torsional motion about the undeflected axis CDO, while 
the other is a pure flexural motion having the same frequency. From 
physical considerations it is now clear that the third mode is purely 
flexural and has a different frequency. This mode and frequency can 
readily be found from the conditions for absence of the two funda¬ 
mentals, namely (see § 10-2 (6)) 

J^Aqs[l,-l,0]Aq = [4, -4,0 ]q = 0, 
k' 2 Aq = [l,l,8]Aq = [*,*,8]q = 0. 

Hence q t = (fa = — {fe. or the mode in the overtone is k a = { — 1 , — 1,1}. 
The relation Uk a = m s k a now gives at once ttt 3 = 3. It can in fact be 
verified that 

U m = tsT 1 - 1 - 1 ] f 12 ”* 0 0 1 [" 1 1 2 

1 1-1 0 12 m 0 -9 90 

8 0 lj 0 3 m J L~8 -8 2 

_ i2 m ri -nr 1 1 21 3™r-n[-8 -8 2], 
~l8~ 1 1 L_9 9 oJ + 18 -1 

_8 OJ L 

and it will be observed that the canonical form of U is diagonal. 

10-9. Example No. 7: The Static Twist of an Aeroplane 
Wing under Aerodynamical Load. An illustration of the iterative 
solution of a statical problem may be useful. Suppose an aeroplane 
wing to be subjected to an aerodynamical twisting moment which 
varies with the angle of twist. It is required to determine the equi¬ 
librium configuration of the wing. 

Suppose the wing to be divided into n segments. Let a r be the 
initial mean incidence of the rth segment, measured from the position 
corresponding to zero aerodynamical moment, and let a r + 0 r be the 
incidence under load. The aerodynamical twisting moment is assumed 
to be Mr = —W r (oc r +d r ), where — W r is a positive coefficient depending 
on chord, wind speed, etc. The flexibility matrix of the wing will be 
denoted by O, and <D -1 = E. Hence if the set of aerodynamical loads 
M is applied to the wing, and if W denotes the diagonal matrix of the 
coefficients W r , the twists 6 will be given by 

6 = <t>M = -<3)W(<z+6) = g+fd, 
where g = — <t>W(x and/= —Q)W. 


( 1 ) 
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This equation can be solved at once by means of the iterations 
6(8+ 1) = g+f&(a) as described in § 4-13 (d), provided that the latent 
roots of/all have moduli less than unity. It will now be shown that if 
the equilibrium position is stable, this condition is satisfied. 

Equation (1) can be written 

(E+W)6+ Wa = 0 . 

Choose new coordinates <f> given by 

(E+ W)<f> = (E+W)6+ Wa, 

so that <j> is measured from the equilibrium position. In any displace¬ 
ment <}> the change in the potential energy is given by ffi(E+ W)<j>\ 
and if.the energy is to be a minimum in the equilibrium position, which 
is the condition for stability (see §8-11), then the quadratic form 
+ W)<f> must always be positive. In this case the arguments of 
§ 9-9 can be applied to the determinant | EX + W \. When A = 0, since 
IF is a diagonal matrix of negative quantities, the determinants A 1 of 
§9-9 are alternately positive and negative. When A = 1, they are by 
hypothesis all positive. It follows that the roots of | EX + W | =0, 
which are the latent roots of — E~ V W = —<S>W = /, are all real, positive, 
and less than unity. Hence if the equilibrium position is stable, the 
iterative process converges. 

As a simple numerical example, imagine a wing to be divided into 
three sections. Let unit torque applied at sections 1, 2, 3, respectively, 
produce corresponding angular deflections (in degrees) of {1,1,1}, 
{1,2,2}, {1,2,3}. Further, let the chord and wind speed, etc., be such 
that = —0-1, W 3 = —0-1, W 3 = — 0'05 unit of torque per degree. 
Finally let the uniform initial incidence be one degree. Then 


<PW = 

i i r 


*01 

0 

0 

= 

*01 01 

006 * 


1 2 2 


0 

01 

0 


0-1 0-2 

01 


1 

CO 

<M 

i-H 


_ 0 

0 

0 - 05 . 


01 0-2 

0-16 

9 = 

-<t>Wa = 

. ; 
’ i 

*01 

01 

0 - 05 " 

"1“ 

= 

' 0 - 25 * 

. 



0-1 

0-2 

0-1 

1 


0-4 




01 

0-2 

0-16 

1 _ 


0 - 46 . 



Choosing 0(0) = g, and calculating in succession the values of 

6(8 + 1) = g+f6(s), 
we obtain the following results: 
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i 

II 

<3 

M 0) 

0(1) = 
g+/m 

m i) 

0(2) 

/»(2) 

6(3) 

0-25 

0087 


0-120 

0-370 

0-132 

1 

0*4 

0150 


0-206 

0-606 



045 

0172 

0-622 

0-237 

0-687 


/■VETfS 


M3) 


MV 

0(5) 

MB) 

0(6) 

M0) 

0(7) 

0-136 

0-386 

0-138 



0-389 

0-139 

0-389 

0-235 

0-635 




0-639 

0-239 

0-639 

0-270 

0-720 


0-724 

0-275 


0-275 

0-725 


Hence the solution is 

0 = {0-389,0-639,0*725}. 

It may be noted that the latent roots of/are 0-373, 0-050, and 0-027. 


Part II. Systems with Damping Forces Present 

. 10-10. Preliminary Remarks. The methods of Part I can be 
extended to the case where damping or “motional” forces are present. 
The first step is to replace the Lagrangian equations by an equivalent 
system of the first order. Usually the most convenient scheme of 
reduction is that given at the end of § 9-7. If the original equations are 

(AD* + BD+C)q = 0 , 

and if y = {?, g}, the reduced equations of free motion will be 

Dy = u y> .( 1 ) 

in which u s T 0, I 1. 

l-A-W, -A-'BJ 

If y = e^k 8 denotes the typical constituent of (1), then 

(A 8 I-u)k 8 = 0, 

and this equation can be treated, as before, by the iterative method. 
The modes of motion are obtained successively in descending order of 
the moduli of the latent roots of u (i.e. of the roots A 8 ). If the modes 
are required in ascending order of the moduli, then the process should 
be applied to the equation (Aj 1 / — vr 1 ) k 8 = 0. 

Since the latent roots of u will frequently be complex, it may be 
necessary to apply the formulae given in § 4-20. If a pair of conjugate 
complex roots A x , A a = /i ± io) are dominant, then fi and <j> can b© 
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determined from the relations 

a+ * = E »+1 E s+3-K+2 

fl -t*W — rp jp jp2 > 

"s+2 "a+l 


1010-1011 


,2 _ Ea+l^a+Z’ 


e 8 e m -eu 


o„ __ E b E m -E m E^ % 

E S E 8 ^EI +1 • 

where E 8i E b+1 , etc. are consecutive values of a homologous element of 
the column or row used in the iteration, and s is sufficiently large. 

10*11« Example: The Oscillations of a Wing in an Air- 
stream. The system to be considered is a model aeroplane wing 
placed at a small angle of incidence in an airstream. The three degrees 
of freedom assumed are wing flexure, wing twist, and motion of the 
aileron relative to the wing. When the wind speed is 12 feet per second 
the matrices of the dynamical coefficients are as follows:* 


It is found that 


Hence 

u-r 


" 17-6 

0-128 

2-89 

0-128 

0-00824 

0-0413 

_ 2-89 

0-0413 

0-725 

" 7-66 

0-245 

2-10 ' 

0-0230 

0-0104 

0-0223 

0-600 

0-0756 

0-658 

'121 

1-89 

15-9 ' 

0 

0-0270 

0-0145 

. 11-9 

0-364 

15-5 


x 10-a, 


xlO- 8 . 


A - 1 = 

0-1709 

1-063 

- 0 - 7417 ' 


1-063 

176-5 

- 14-29 


.- 0-7417 

- 14-29 

5-150 . 


' 0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

1 

— 11*85 

- 0-08172 

8-764 

- 0-8885 

0-003147 

0-1054 

41*43 

- 1-573 

202-0 

- 3-628 

- 1-016 

3-235 

28*46 

- 0-08696 

- 67-82 

2-920 

- 0-05901 

- 1-512 


* Numerical data extracted from Table 24 of Ref. 30. 
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When the initial arbitrary column is chosen to be 

y( 0 ) = { 0 , 0 , 0 , 0 , 0 ,l}, 

the iterative process yields 

y( 1) = uy( 0) = {0,0,1,0-1054,3-235, -1-512}, 

y(2) = uy( 1) = {0-1054,3-235, -1-512,8-521,193-4, -66-42}, 

and so on. We now choose a homologous element from the successive 
columns, and derive (for instance) the Table given below. The method 
of tabulation is a modification of a scheme due to Aitken (see § 4-20). 


No. of 
column 
8 

Last element E t 

F.= 

E»Et+t~E)+l 

= /A* + CO* 

G,= 

E. E .. 3 — 

^ = 2 m 


1 

-67-71 


_ 

_ 


-1*612 

-46*09 x 10* 

71*23 

— 

— 


-6*642 x 10 

-32*83 x 10 4 

71*62 

— 

— 


2*176 x 10* 

-23*48 x 10 8 

71*93 

— 

— 

5 

4*294 x 10* 

-16*89 x 10 8 


— 

— 

a 

-2*318 x 10 4 

-12*16 x 10 10 


— 

— 

7 

-2*681 x 10* 

- 8*768 xl0 ia 


— 

— 

8 

2 *144 x 10* 

- 6*309 xl0“ 

72*04 


-1*769 

9 

1*662 x 10 7 

- 4*646 xl0 1# 

— 

— 

— 

10 

-1*819 x 10 8 

— i 

— 

— 

— 

11 

-0*7962 x 10 # 

— 

— 

— 

— 


Accordingly, /t 2 + w 2 = 72-04 and 2/i = — 1-769, so that the dominant 
latent roots of the matrix u are A! and its conjugate A 2 , where 

Ax = fi+ia) = —0-885+ *8-443. 

The modal column, say k 1 = £, + i-rj, corresponding to the root 
Ax, is readily found. Since k x is arbitrary to a complex scalar multi¬ 
plier, we may choose for instance the rth column in the iteration to be 
half the sum of the conjugate complexes £ ± ir/, i.e., £. Then the (r + l)th 
column will be //£— (in/, and hence £ and 7 are obtained. The first three 
elements of £+ it/ define the amplitudes and epochs of the generalised 
coordinates q, and the last three those of the generalised velocities q. 
Actually, since q = A q, the last three elements in any column are iden¬ 
tical with the first three elements in the succeeding column; hence q 
may be determined from one column only. In the present instance, 
the 10 th column is 


10*{ — 2*412, -39-00, 15-52, 26-65, 528-8, -181-9}. 
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On omission of the faotor 10® this gives 

{£i,l a ,£a} = {-2-412, -39-00, 15-52}, 
while is defined by 

-0-885{- 2-412, -39-00, 15-52} -8-443 {VvVvVa} 

= {26-65, 528-8, -181-9}. 

Henoe 

lt\ — {—2*412 — i 2*904, - 39-00-» 58-55, 16-52+ »19-92}, 
or, since a scalar multiplier may be extracted, 

Jbi = {1-0, 18-53 + »1-961, - 6-686-» 0-2085}. 

This value can be verified by substitution in the equation 
Af^ll: 1 +A 1 J5j5: 1 + Ck^ = 0. 

The remaining modes of motion can be found as usual by the use of 
the condition for absence of the dominant mode. If R m and 22 m+1 are 
two successive rows obtained by repeated postmultiplication of an 
arbitrary row by u, then (see § 4-18) the conditions for absence of the 
dominant mode are R m y = 0 and R m+1 y = 0. In the present example, 
if [0,0,0,0,0,1] is used as an initial row, then the 9th and 10th rows 
found by the iterative process may be written 

8-273x 10 7 [1, -0-000781,-1-910, -0-004369, -0-001575, 0-1877], 

44-09x 10 T [1, -0-002531, -2-466, 0-2924, -0-001927, -0-4126], 

If these rows are used in the above conditions, and y x and y 4 (=y x ) 
are then expressed in terms of the remaining coordinates, we find 

y 1 = 0-000807y a + l-918y a + 0-001680y 5 — 0- 1789y # , 

y t = 0-005896y a + l-840y 3 + 0-001186y 6 + 2-023y e . 

We may therefore write 


~y{ 

= 

'o, 

0 - 000807 , 

1 - 918 , 0 , 

0 - 001580 , 

- 0 - 1789 - 



Vi 


0 , 

1 , 

0 , 

0 , 

0 , 

0 


Vt 

Vi 


0, 

0 , 

1 , 

0 , 

0 , 

0 


Vi 

Vt 


0 , 

0 - 005896 , 

1 - 840 , 0 , 

0 - 001186 , 

2-023 


y « 

Vs 


0 , 

0 , 

o, 

0 , 

1 , 

0 


y& 

-y«. 


Lo, 

0 , 

0 , 

0 , 

0 , 

1 
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Introducing this condition on the right-hand side of the equation 
A y = uy , we obtain a modified equation A y = vy, where 


'0, 

0-005896, 

1-840, 0, 

0-001186, 

2-023 ' 

0, 

0, 

O 

O 

1, 

0 

0, 

• o, 

O 

O 

0, 

1 

0, 

-009652, 

-15-60, 0, 

-0-01663, 

0-4279 

0, 

-1-561, 

274-8, 0, 

-0-9548, 

-11-52 

.0, 

-0-04678, 

-7-861, 0, 

-0-01058, 

-0-6963. 


This matrix possesses properties similar to those of u except that 
zero latent roots have been substituted for the original dominant roots 
fi±io)= — 0-885 ± i 8-443. Hence the iterative process, when applied 
to v , would yield the subdominant latent root or roots of u and the 
associated mode of motion. 



CHAPTER XI 


DYNAMICAL SYSTEMS WITH SOLID FRICTION 

11*1. Introduction, (a) General Range of the Theory. The theory* 
to be given relates solely to dynamical systems which, apart from the 
presence of solid friction, obey linear laws. The term solid friction is 
interpreted to mean a resistance between two bodies, due to tangential 
surface actions, which follows the simple idealised law that its magni¬ 
tude has a constant value so long as relative motion between the bodies 
occuts. Whenever the forces tending to produce relative motion fall 
short of the foregoing value, the resistance adjusts itself to balance 
these forces, and no relative motion results. These simple laws, which 
are adopted as a mathematical convenience, take no account of the 
difference between static and dynamic friction, and thus only approxi¬ 
mately represent true conditions. 

The influence of friction on the oscillatory behaviour of a dynamical 
system is often somewhat unexpected. For instance, the introduction 
of friction may cause an otherwise stable system to develop maintained 
oscillations. 

(6) Experimental Illustration of the Influence of Friction . Some 
wind tunnel experiments! carried out at the National Physical 
Laboratory in connection with tail flutter of an aeroplane illustrate 
some of the effects of solid friction. The model used had a rigid front 
fuselage but a flexible rear fuselage and tail unit, and it was suspended 
by a sling of wires from a turn-table locked to the roof girders of the 
wind tunnel. Occasionally the locking bolts of the turn-table worked 
slack under the tunnel vibration, with the result that the whole model 
became capable of angular displacement in yaw, but only under con¬ 
siderable frictional constraint. The degrees of freedom of the dynamical 
system, when tested under such conditions, can (for simplicity) here be 
taken as angular displacement of the rudder, lateral bending of the 
rear fuselage, and yawing of the whole model resisted by a considerable 
frictional moment. At moderate airspeeds the motion of the model was 
of a peculiar spasmodic type, which gave the impression that increas¬ 
ing and decreasing oscillations were occurring in succession. 

* Chaps, ix and xn are based on investigations originally described in Refs. 38 and 39. 

t See pp. 233 and 260 of Ref. 40. 
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The explanation is probably as follows. At the wind speeds which 
gave rise to the spasmodic motion, the binary system composed of the 
rudder and the fuselage was unstable, whereas the addition of the 
freedom in yaw rendered the system stable. A very small initial im¬ 
pulse was incurred, and this was sufficient to start growing rudder- 
fuselage oscillations. It was, however, too small to overcome the 
friction opposing yawing movement, so that meanwhile the model 
remained gripped in yaw. However, ultimately, a yawing moment 
due to the growing oscillations developed, which was sufficiently large 
to overcome the friction. All three degrees of freedom then cooperated 
in the motion, but since the system was then stable, the yawing moment 
eventually decreased and the model again became effectively gripped. 
The cycle of changes then repeated. Briefly, the phenomenon is 
attributable to the successive “sticking” and “unsticking” of a 
frictionally constrained member of the system. 

A feature of importance in the illustration is the fact that the system 
would have been definitely stable had the friction been absent. Clearly 
then the state of stability of a system when friction is completely 
removed is no sure guide to the behaviour of that system when friction 
is present. 

(c) Ankylosis . The effect just described as the “sticking” of a 
degree of freedom is known as ankylosis * Whenever the system is 
moving without its full complement of degrees of freedom the motion 
is said to be ankylotic , and the particular generalised coordinates which 
are arrested are the ankylosed coordinates. 

(d) Types of Oscillation. When solid friction is operative the 
oscillations which occur in a given degree of freedom can be of three 
different general types, namely: 

(i) Decaying oscillations , in which the coordinate concerned either 
tends to a constant value or becomes permanently ankylosed. 

(ii) Unbounded oscillations, in which the coordinate tends to attain 
indefinitely great values. 

(iii) Bounded oscillations , which are defined to be any oscillations 
other than those of types (i) and (ii). 

(e) Linear and Non-linear Systems. When friction is taken into 
account we have to deal with forces whose functional dependence on 

* The term * ankylosis’ is used by Poincar4, Jeans and others to denote loss of one or 
more degrees of freedom in a dynamical system. 
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the generalised velocities is not linear. Strictly speaking, therefore, 
the problem must be classed (though as a very special case) under the 
general subject of non-linear dynamics. In the case of a linear system 
the values of the generalised coordinates and velocities at any instant 
of motion are all changed proportionally when the initial values are 
changed proportionally. In other words, so far as concerns similar 
disturbances, the motion may be said to be proportional to the magni¬ 
tude of the initial disturbance. With a non-linear system the motion 
will depend in a very much more complicated way on the initial 
disturbance. 

(/) Effects of Very Large Disturbances. If the system is linear in 
all respects apart from the friction, two general propositions are self- 
evident: 

(i) No disturbance, however large, can produce unbounded oscilla¬ 
tions when friction is present if these cannot occur when friction is 
absent. For even if increasing oscillations are possible, yet the growth 
necessarily ceases when the movements become so large that the 
frictional forces are insignificant in. comparison with the other applied 
forces. 

(ii) A sufficiently large disturbance (in general) produces unbounded 
oscillations when friction is present if such oscillations are possible 
with the frictionless system. 

(g) Effects of Arbitrary Disturbances. The ultimate type of motion 
which results from a disturbance of arbitrary magnitude will depend 
upon the number of sets of bounded oscillations which are possible and 
upon the question as to whether these particular motions are stable 
when slightly disturbed. Suppose, for instance, that a very large dis¬ 
turbance gives rise to unbounded oscillations and that a small similar 
disturbance produces decaying oscillations leading to ankylosis and 
ultimately to complete rest of the system. Then it may be inferred that 
for one magnitude at least of the disturbance the oscillations must be 
bounded. If these are the only possible bounded oscillations, they 
must obviously be unstable, and they will therefore not be realisable in 
practice. However, more generally there may be an odd number of sets 
of bounded oscillations, alternately unstable and stable, the largest 
and smallest being unstable. In these cases, for a certain range of the 
initial disturbance, the resulting motions will tend to one or other of 
the stable sets of oscillations. 
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11*2* The Dynamical Equations. The m generalised coor¬ 
dinates q are supposed to be measured from the mean configuration, 
and the system is taken to be linear except for the friction. The 
dynamical equations are then of the type 

f(D)q = (AD* + BD + C)q = p 9 .(1) 

where p denotes the generalised frictional forces. When the friction is 
applied in a general manner the precise specification of p is com¬ 
plicated, and as a simplification attention will be restricted to systems 
in which each degree of freedom has its own independent frictional 
constraint. Thus, it is assumed that the typical generalised frictional 
force Pi has a constant magnitude R t when <^4=0, and that the sign is 
such that the force always opposes the motion in g < : hence p t = — if 
q t > 0, and pi = + 2? i if q t < 0. On the other hand, if = 0, the value 
of pi lies between the limits ± R iy and is determined by the other 
circumstances of the motion. 

The motion in any time interval may be such that continuous 
movements occur in all the degrees of freedom, or it may be such that 
throughout the interval one or more of the generalised coordinates are 
arrested by friction. In the first case the motion will be described as 
complete in the interval, and in the second case as ankylotic* If a gene¬ 
ralised velocity happens to vanish in the interval without becoming 
stationary, the motion will still be classed as complete. On the other 
hand, if the velocity which vanishes is also stationary, the motion at 
that stage will be viewed as ankylotic. 

For brevity, any stage of the motion throughout which fa is con¬ 
tinuously positive, and never actually zero, will be described as an 
upstroke in q t : similarly, if g* is continuously negative and never zero, 
the motion is a down-stroke in g*. Hence p t = — 2? i or + R t according 
as the stroke in q t is upwards or downwards. The two instants at which 
a given stroke begins and ends will be referred to as the terminal 
instants for that stroke: when a distinction between the two terminal 
instants is necessary the one which relates to the beginning of the 
stroke will be called the starting instant and the other the stopping 
instant . 

Throughout any time interval in which no generalised velocity 
vanishes the values of the forces p are constant both in sign and 


* Note that the definition of ankylotic motion would require generalisation if the 
frictional constraints in the several degrees of freedom were not independent. 
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magnitude. The solution of the dynamical equations for such an 
interval will now be given. For simplicity it is assumed in the analysis 
that the n (= 2m) roots A 1; A 2 ,..., A n of A(A) = 0 are all distinct. 


Write C~ l p = 6, .(2) 

and let 0 denote the column formed from the m constants 6 and from 
m oiphers, so that in the partitioned form 

© = {0,0}. . (3) 

Also put y(t) = {g(0, ?(<)}. .(4) 


Then if M(t), k, A and l are matrices as defined in §§ 6* 10 and 6*4, and 
if r denotes some datum instant in the interval, the direct matrix 
solution is readily verified to be 

y(t) — & = lM(t— t) i _1 {y( T ) - ©}. 

Alternatively, if n reducing variables a (t) (see example (iii) of § 6*4) 


and n quantities /? are introduced such that 

a(0 = l-'yit), (6) 

/? = i- 1 ©, (6) 

the solution is expressible as 

<*(*)-P = (7) 


Matrices formed from particular columns or rows of the (m,») 
modal matrix k will often be used in the sequel. The matrix {& lr } 
formed from the rth column of k (namely, the rth modal column) will 
as hitherto be written in the abbreviated form k r . Similarly the matrix 
\k pi \ formed from the jpth row of k (namely, thepth modal row*) will be 
denoted for brevity by l p (see also remarks in § 1*2 (c)). 

11*3. Various Identities. Certain identities, which will be of use 
later, will now be obtained. 

(a) The Reducing Variables a(t). The relation (11*2*6) gives im¬ 
mediately by definition of the matrix l 

{3(0, 3(0} = {MA}a(«), 


so that q(t) = ka(t), .(1) 

q(t) — kAa(t). .(2) 


* Note that whereas the rth modal column k r is naturally associated with the rth root 
A,, of A(A) = 0, the pth modal row is associated with the pth. generalised coordinate q 9 . The 
modal rows must not be confused with the rows of the matrix k defined by (3*6*8). 
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Again, differentiation of (2) yields q(t) = kAdc(t) y while differentiation 
of (11-2-7) leads to 

dt(t) = AM(t-T){oc(r)-fi} = A{a («)—>?>. 

Accordingly, q(t) = &A 2 {a(0--yff}. .(3) 

When the expressions for q, q and q given by (1), (2) and (3) are sub¬ 
stituted in (11-2-1), the result is 

(AkA 2 + BkA + Ck) oc(t) — AkA 2 /3 = p . 

But on account of the properties of the modal columns 
AkA 2 + BkA + Ck = 0. 


Hence, writing as A -1 , we have 

kA 2 J3 = —ap. .(4) 

Equation (3) may be therefore expressed as 

q(t) = kA 2 a(t) + ap. .(5) 


It is to be noted that since the discriminants of the kinetic energy 
are positive the principal diagonal elements of the matrix a are all 
positive. 

The relations (1), (2) and (5) give the displacements, velocities and 
accelerations in terms of the reducing variables. 

It .will be seen later that the reducing variables and the quantities 
P usually occur multiplied by a modal constant. Formulae to aid the 
calculation of these products can be derived by use of the identity 
(6-5-8), which for the present second-order equations reduces to 

kM(t - r) l- 1 = S [A, A + B, A] eW-r). 

r ‘ 1 A(A,) 

Since this is true for all values of t , the matrix coefficients of e^~ r) on 
the right and on the left can be identified. Hence 

kE'l-^^lArA + B.A], .( 0 ) 

A(A,) 

in which E r is the null square matrix of order n with a unit substituted 
in the rth principal diagonal place. On postmultiplication of both sides 
of (6) by y(t) and application of (11-2*5) we readily obtain the identity 

a T k T = ^ [A r A + B, A] { q(t), q(t)}> 

A(A,) 


(7) 








338 VARIOUS IDENTITIES 11*3 

where k T denotes the rth modal column. This can be written alter¬ 
natively as jpn \ 

«rK = -gf- [- O,\A\ {?(<), j(<)}.(8) 

K^K) 

(6) The Quantities ft. From equations (11*2-3) and (11*2*6) it follows 

that {d, 0} = {&, kA}{}, so that ... ... 

kp = "» .(9) 

kAfi = 0 . .( 10 ) 

Again, on postmultiplication of both sides of (6) by 0 and use of the 
relation C6 = p, we readily find 


/?,*, = - 


f(K)P 
a) • 
A r A(A r ) 


.( 11 ) 


An alternative to (11) can be deduced from the relation 
A(A) =/ u (A)F u (A) +... +f u WF u (\) +...+/ lm (A) F lm (A). 

The only term on the right of this equation which involves the stiffness 
coefficient G u is evidently f u . Hence on total differentiation with 
respect to Gu we have 

d(X. A ( A) “ A(A) 9C^ + a^7 A(A) = A(A) 0C^ + ' Fls(A) ' 


■'Is 


If in the variation considered A is chosen to correspond to the root A,., 

so that A(A r ) = 0 and ^~A(A r ) = 0, we obtain 
dC u 

(1) 3 A- 

FM = -A(A,)^. 

We may accordingly write 


is 


( 1 ) 


F(A r ) = -A(A r )JV(A r ), 


where 


N(K)= 


raA, 

0A, 

0A, 

dC n > 


sc ml 

3A, 

0A, 

0A, 

dC 12 ’ 

0C 22 ’ 


0A, 

0A, 

0A, 

\-SC lm ’ 


3CLn 

mnv 


and on substitution in (11) this gives the identity 

KPrK = N(K)p- 


.(12) 
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By performing the differentiations with respect to the damping 
coefficients By or the inertial coefficients Ay instead of the stiffness 
coefficients, we can derive the relations 


SAy 1 0Ay 1 

dCy A,. dBy Aj! dAy 


The application of the preceding formulae to systems having only 
a single frictionally constrained coordinate will now be considered. 


11*4. Complete Motion when only One Coordinate is 
Frictionally Constrained. The frictionally constrained coordinate 
will be assumed to be q m . In this case all the frictional forces p i are 
zero with the exception of p m , and p m —±R m according to the sense 
of the stroke. If <j> denotes the column of m constants 0* such that 

C<j> = {0, 1}, 

then clearly 6 = — R m <f> for an up-stroke and 6 = + R m <f> for a down- 
stroke. Further, if O is defined similarly to 0 as the column of m values 
<}> i followed by m ciphers, and if y is such that y = Z _1 0, then ft = — R m y 
for an up-stroke and/? = +/? OT y for a down-stroke. Equations (11-3-11) 
and (11-3*12) may in this case be expressed as 

KyrK = (^’ ^.&) = . FmmiK)} ' 

a ^) .(i) 

while (U-3-9), (11-3*10), and (11-3-4) yield respectively 


ky = <j>, ( 2 ) 

kAy = 0, (3) 

MV = ~{a im }. (4) 


Let the initial disturbance, assumed imposed at t = r, be given by 
the values y(r) or a(r), and denote as Z 0 , t v etc. the successive terminal 
instants at which—while the complete motion is in progress—the 
velocity q m vanishes. If q m { J ) + 0, the sign of this velocity will fix the 
sign of p m in the first stage of the motion. For the sake of definiteness 
we shall assume that q m (r) > 0, so that the first motion in q m is upwards 
and p m = -R m (see Fig. 11-4-1). 

On application of (11-2*7) we have for the solution in the first time 
interval a(<) + RmJ = M{t _ t) { a(r) + R m y) .(5) 
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Hence the displacements, velocities, and accelerations at any time 
are given by 

q(t) = ka(t) = kM{t-r){a(T)+B m y}-B m </>, 
q(t) = kAa(t) = kAM (t— r) {a(r) + R m y), 

q(t) = fcA a {a(f)+ = kA i M{t~T){a{T)+R m y) . (6) 

In particular the velocity q m vanishes when 

i m AM(t-T){a(T)+R m y} = 0, 

where t m denotes the mth modal row. The stopping instant will be 
that root t = t 0 of this equation which exceeds, and lies nearest to, 
the value r. 



To determine the general nature of the next stage of the motion, 
assume firstly that ankylosis does not occur, and that the motion in 
q m continues with a down-stroke in accordance with Fig. 11*4*1. On 
this hypothesis the solution in the interval (t 0 , < x ) is given by 


= M (t-h)Hh)- R my}- .(V 

This yields for the starting acceleration q m (t 0 ) the value 

I m A 2 {a(< 0 ) —i? m y}, 
which, on application of (4), can be written 

* m A 2 a(<o)+a mm .R m . .(8) 


If this expression is negative, the motion in q m obviously continues 
Tvith a down-stroke as assumed. 

Alternatively, suppose that an up-stroke in q m ensues at time t 0 . 
The solution in this case would be given by an equation similar to (5), 
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but with r replaced by t Q . The necessarily positive starting acceleration 
for the up-stroke in q m would then be 

*mA*{a(<o) + R m y}- 

But on application of equation (0) this expression is seen to be equal 
to the stopping acceleration for the previous partial up-stroke in q m , 
which is negative. Hence the possibility of the stroke being upwards 
is excluded. We may thus conclude that if the expression (8) is negative 
the motion remains complete and continues with a down-stroke in q mf 
but that otherwise the coordinate q m remains arrested. 

If ankylosis does not occur, the motion proceeds in accordance 
with (7) until q m again becomes stationary. This occurs when 

AJU (t — t Q ) {oc(t 0 ) — R m y } = 0, 

and the corresponding stopping instant t x is that root of this equation 
which exceeds, and lies nearest to, the value t 0 . The reducing variables 
then have the values given by 

«(<i) - R m y = M (h~ to) {a(«o) - R m y}> 


and the condition that a second up-stroke in q m is realised is readily 


shown to be 




A continuation of this process formally determines the whole motion 
until ankylosis occurs. The sequences of relations which provide the 
step-by-step solution will now be summarised. The formulae are suited 
to the case where q m {T) > 0, so that the first motion in q m is upwards.* 


(A) Successive Stages of Motion. After the first partial up-stroke 
in q m the down-strokes and up-strokes in q m occur in alternation. The 
motion in the typical down-stroke in q m commencing at time t 2r is 

a(t) — R m y — M(t — t 2r ) {^(^r) “ m7}> .(®) 

while that in the typical up-stroke in q m commencing at time t 2r+1 is 
a(t) + R m y = M(t-t 2r+1 ) + R m y) .(10) 

(B) Recurrence Relations for the Reducing Variables. These may be 
written conveniently as 

M( - < 0 ) (a(< 0 ) + R m y) = M(-t) («(t) + R m yj, 

M ( ^i) {ot(^i) — R m y) — M( — 1 0 ) {a(« 0 ) — 

M(-t a ) (a(< 2 ) + R m y} = M(-tj) {a(^) + R m y), 

* The formulae can be adapted at once to the case where q m (r) = 0. 
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and generally for 01 

.(ID 

Direct addition of these relations gives 

(~l)»a(t t ) + R m y = (-1 )* (< s - r) (a(r) + R m y} + 2I» R m y, 

.( 12 ) 


where I>) = M(t 8 -1^) - M(t e - t a _ 2 ) + ... + (-1) 4 " 1 M(t s -1 0 ). 

(C) The Terminal Equations . These fix the terminal instants / 0 , t v 
etc. at which the successive stationary values of q m occur. The first 
equation of the sequence is explicitly 


n 


S 

r-l 


A r fc mr e A A(a r (r) e~ A ’- T + R m y r e~ x ' r ) = 0, 


while for s > 1 the typical equation is q m (t s ) = 0, or 


i m Aa(t s ) = 0. 


(13) 


On substitution for a(t 3 ) from (12) this gives explicitly 


2 - 1) 8 oc r (T) e-V+ R m y r E(8, A,)) = 0.(14) 

r—1 

where 


E(s, A,.) = 2 e”Vs-i — -f ... 4 - 2( — 1 ) 8_1 4. (— 1 ) 8 e _A ^. 

The terminal instant £ s must be that root of the sth terminal equation 
which exceeds, and lies nearest to, the value t 8 _ v It may be noted that 
if the roots of A (A) = 0 are complex then each terminal equation 
necessarily has an infinite number of roots.* 


(D) Criterion for Ankylosis . Ankylosis will occur (with the co¬ 
ordinate q m arrested) at the first stopping instant t s for which 

(-l) 8 fmA 2 a(0 + ®mmtfm>°- .(15) 

This criterion is valid for s^ 0. 


Example 

System with a Single Degree of Freedom .f With m = 1 the equation 
for complete motion is 

leading to a single constant <f> x = 1/C n . The two roots A are assumed 

* If t, is large the least damped exponential will predominate in equation (14). It readily 
follows that this equation necessarily has an infinite number of large real roots, 
t Some aspects of this problem are discussed in Kef. 41. 
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to be complex, so that, say, = fi + ico and A 2 = fi — ico. The modal 

matrix may be taken as [1,1], and we may accordingly write 


1 = 


C 1 1 

IA A a 


and 1 = f — A, 


r-A a 11 . 

A, -lj 


% 

Hence fa, a 2 } = 1-%, &} = — {A^ - q v -A l9l + gj, 


2o) 1 

{rx»r 2 } = ^ = 2cjc n ^ 2 ’ — Ai ^‘ 

If the initial disturbance is assumed to be imposed at the starting 
instant t Q of a down-stroke, then 

{ a l(^o)» a 2^o)} = {^2> 


and the motion in the first down-stroke is given by 

Further, the equation for the stopping instant t x is explicitly 

^i( a i(*o) - r i7i) + A 2 (a 2 (y - R l72 ) e ( ^ A * = 0. 

This reduces to 

- \ (?i(<o) -^r) ( M * + «> 2 ) e ' dh ~ 4 * >sin "(<1 -*o) = 0. 

of which the lowest root is t x — t Q = njo). Since this time interval is 
independent of the starting displacement the time for any 

stroke is clearly T — 7t/g>. 

The formula (12), with the simplifications r = t 0 and t 8 — t 0 = sT , 
may now be applied to determine the displacement at the end of the 
ath stroke. In the present case 

M(sT) = [e^ T 0 1 = (- l) 9 efi* T I, 

[ 0 

T{s) = M ( T) - M{2T) + ... + (-1 y- 1 M(sT) 



and equation (12) therefore reduces to 

(-1)* W<,)> a 2 (*,)} = ^(?i(*o) e/ “ T “ ^ ~ 1 ) coth ^r)( A «> “ A i)’ 
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we obtain finally 

?.(«-<-D^ooth(g) - (- ir( ?1 («-goothC)e»»/-. 


The condition that the motion can actually begin is (see (15) with 
** " to) A!«,(«„) + A|a a (( 0 )+a u fij < 0, 


which reduces to 


?A>)>gT' 


If this inequality is satisfied, and if /i is positive, it is evident from (16) 
that the motion will grow to an indefinitely large amplitude. 

It should be noted that with systems having more than a single 
degree of freedom, growing oscillations strictly analogous to the 
simple type (16) cannot occur. With such systems, oscillations having 
a constant semi-period are also (in general) necessarily steadily main¬ 
tained (see example to § 11*6). 


11*5. Illustrative Treatment for Ankylotic Motion. For the 

sake of simplicity attention will again be restricted to the case where 
only a single coordinate q m has frictional constraint. The analysis is 
thus supplementary to that of § 11*4. 

Suppose, then, that at some particular stopping instant t a ankylosis 
occurs. Since the coordinate q m now remains arrested, the dynamical 
equations for the special case under consideration become 

fll(D) 9l + • • • ) 9m—1 = — @1 

... .( 1 ) 

/m-l,l(*®)?l "b • • • 9m -1 = ^m-l,m?m(^a)>. 

while the value of the frictional force p m at any stage of the ankylotic 
motion is given by 

f wlC®) 9l +...+/ m,m- lC® ) 9m—1 = Pm* 

To render the previous analysis immediately applicable to equations 
(1), without the introduction of new symbols, the convention will 
now be made that any symbol originally used in relation to the m 
degrees of freedom q v q v ..., q m will, if shown in clarendon type, have 
a similar significance in relation to the m— 1 degrees of freedom 
?i> ?a» • • • > 9m- i* Thus y(t) denotes the column of the m — 1 displacements 
9v->9m-i an d m—l velocities q v q 2 , fa —2 quantities in 

all); while M(t) and I are square matrices of order n — 2 involving the 
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fundamental constants of the system (1) and the n — 2 roots X x ,X 2 , 
X n _ 2 of the appropriate determinantal equation. 

The solution of equations (1) then is 

S m - 0 = lM(t - 1 8 ) l~'{y(t a ) - 0}. .(2) 


The column matrix 0 has n — 2 elements, of which the last m— 1 are 
ciphers and the first m — 1 are the constants 8 defined by the relations 

®U®1 + *•* + Q.m-l ®m-l = m?m(0> 

The motion will continue in m — 1 degrees of freedom in accordance 
with (2) until p m > R m or < — R m . 

11*6. Steady Oscillations when only One Coordinate is 
Frictionally Constrained. As a relatively simple application of the 
theory the conditions will now be examined under which the oscilla¬ 
tions of a system having only a single frictionally constrained co¬ 
ordinate q m are steady . Oscillations will be said to be steady when they 
are of the special type in which the displacements and velocities in all 
the degrees of freedom at the stopping instant of any stroke in q m are 
exact reversals of the corresponding displacements and velocities at 
the starting instant of the stroke. 

Suppose that a typical down-stroke in q m commences at time t = t Q 
and ends at time t — t v Then the solution during that stroke is given by 

= M(t-t 0 ){a(t 0 )-R m y}, .(1) 

and the displacements and velocities at time t — t x are exact reversals 
of those at time t = t 0 provided a(£ x ) = — a(£ 0 ), so that 
- a(t 0 )-R m y = M(T) {a(< 0 ) - R m y), 

where T — t t — i 0 . 

The last equation may be written 

[M(T) +1] a(t 0 ) = R m [M(T) -1] y, 

whence a r (t 0 ) = R m y r ^^-j .(2) 

for all values r = 1,2,..., n. 

The condition that q m shall be stationary at time f x is f m Aa(^) = 0. 
Since by hypothesis a r (t x ) = — a r (t 0 ), and a r (i 0 ) is given by (2), this 
condition requires that 

» oTXf _ 1 

R m£K k mr'Yr e T\ r+ j = °* 
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This equation fixes the semi-period T, and the appropriate initial 
conditions at time t — t 0 are then given by (2). On substitution for 
a r (< 0 ) from (2) in (1), the solution during the typical down-stroke in 
q m is explicitly 

9,(t)IK= 2 K* r m m = 2 KVrftt-t oA)» 

r-1 r-1 

m= 2 Kk„a r (t)iR m = s KKYrf(t-t<»K)> 

r-1 r-1 

where 8= 1,2, and 

2e<Mo> A »- 

f(t ~ $0> ^r) 35 1* 

The acceleration g m ($) at any instant in the down-stroke is readily 
seen to be given by 

= S W m rVrf(t-to>K)+amm- 

r-1 

The conditions for steady oscillation will now be summarised. 

(A) The semi-periodic time must be a real positive root T of the 

equation n pTX r _ i 

R m n(T)zsR m £ = 0. 

(B) The necessary initial disturbance, which is assumed to occur 
at the starting instant t = t Q of a down-stroke in g m , is given by 

n e TX r -1 

qMI R m = 2 *«-rr: 


.(3) 


r-1 

n 


■(*) 


gTA r __ l 

== ^srYr J 9 J 

where d = 1,2,...,m, and i? m is to be taken definitely with the positive 
sign. 

(C) The initial displacement g m ($ 0 ) must be positive* and the initial 
acceleration q m (t 0 ) negative: thus 

ft (>T\r _ J, 

9m(to)l R m = S *W7r e rA, + 1 > 0 


and 


w gjTAji ___ j 

?m(^o)A®m == *mr7r ^ 


.( 5 ) 

.( 6 ) 


* If q m (t 0 ) were negative and $ m (t 0 ) also negative, the first stroke would still be down¬ 
wards (as postulated), but the displacement q m at the end of that stroke would then neces¬ 
sarily be negative, and not the reverse of q m (t 0 ). 
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(D) The velocity q m (t) must not vanish within the interval (t 0 , t 0 +T). 
Hence the equation n . 


S KK 

r-1 


e TX r -f- l 


9- 


must have no positive real root (other than t — 1 0 = 0) less than the 
selected root t — t 0 = T of (3). 


Example 

Oscillations with a Constant Period are (in general) also Steady . If 
T is the given constant period, the successive terminal instants for 
the strokes in q m may be taken as t 0 = 0, t x = I 7 , t 8 = sT . The 

recurrence relations for a may be written 

<*(T)-R m y = M(T){a(0)-R m y}, 

a{2T) + R m y = M(T){oc(T) + R m y}, 

a(3T)-R m y = M(T){oc(2T)-R m y}, 

and so on. Addition of the equations in successive pairs gives 

cc(T) + oc(2T) = M(T){oc(0) + a(T)}, 

a(2T) + a(3T) = M(T){oc(T) + a(2T)} = i/(2T){a(0) + a(T)}, 

and generally _ 

a (sT) + a(s + 1 T) = M(sT) (a(0) + a( jP)}. 

Premultiplication of the last equation by I m A, and use of the con¬ 
ditions q m (sT) = q m (s+ 1 T) = 0 yields the relation 

0= %e 8TA rA ri .(8) 

r-1 

where ^4 r = A r ^ mr (a r (0)-f-a r (J T )). Taking (8) for any n consecutive 
values of s> we derive a set of n equations, which will be compatible 
either if a(0) + a(!F) = 0 (in which case the motion is steady and T is 
a root of (3)), or if 

1 1 ... 1 = 0 . 
qT\i ... e TX n 


| g(n—1)TA X e (n-l)TA a _ e (n-l)TA n 

This last equation would require T = 7t/g>, where ji ± io) are any two 
conjugate complex roots of A(A) = 0, say A. x and A 2 ; and equations (8) 
would then be satisfied if in addition 

A x + A 2 = 0 and A z = A a =... = A n = 0. 
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THE SEMI-PERIOD EQUATION 

It can be shown without difficulty that the particular value T =* njco 
would in this case still have to be a root of (3), and unless m = 1 this 
would demand a special relationship between the dynamical coeffi¬ 
cients of the system. Hence, if the periodic time is constant, and if the 
system has more than a single degree of freedom, the motion is in 
general also steady. 

11*7. Discussion of the Conditions for Steady Oscillations. 

The nature of the roots of the “semi-period equation” (11*6*3) will be 
considered first. Denote by £ r -f- iy r the complex value of \k mr y r 
corresponding to a complex root = /i T + io) r . Then 

^(T)m S (£ r + iVr)F(Kn .(1) 

r= 1 

h Ft A ~ 1 __ cos o)T — 1 + ie* lT sin 0 )T 

W 6re ~ c (/t+w>)T + i e^* T cos g)T + 1 -fie^sinwT 7 ’ 

(i) Case of Two Purely Imaginary Roots of A(A) = 0. Suppose now 
that two, and not more than two, of the roots A are purely imaginary, 
so that for this pair /i = 0. Then for the one root (say -f icj), 

F(icoT) = (cosojT— 1 + i sin a)T)l (cob toT +1 + isina)T) . ( 2 ) 

If o)T = (2s +1) 7T — e, where s is an integer and e is small, equation (2) 
gives to first order F(io)T) = 2i/e. The two terms of (1) corresponding 
to the two roots ± i(o combine at this value of T into — 4 y/e. Hence 
unless y — 0 the function tl(T)lrj changes from — oo to + oo on passage 
of T through each of the values (2s + 1 ) tt/co* It follows that when two 
(and not more than two) of the roots A are purely imaginary and the 
corresponding value of rj is not zero, the semi-period equation has an 
infinite number of real roots T separated by the values njo), Sn/o), 
57t/cl>, etc. A diagrammatic representation of the graph of f 2(T) for the 
case considered, with y positive, is curve No. 1 of Fig. 11*7*1: the 
possibility of any odd number of roots T between successive asymp¬ 
totes is not excluded. If y is negative, the sense of approach to the 
asymptotes is the reverse of that shown in the diagram. 

On application of the identity (11*4*4) it is readily shown that the 
slope of the graph f i(T) at T = 0 has always the negative value 
— a mm /2. It follows that when y < 0, an odd number of real roots T 
exists between the origin and the first asymptote. 

* For brevity, the symbol T is here temporarily used in two senses; it signifies a root 
of the semi-period equation, and also the current variable associated with the function 
Cl(T). 
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(ii) Case of Two Conjugate Roots of A(A) = 0 with Real Parts Small . 
Next suppose the real part of one pair of roots fi±io) to be numerically 
very small but not actually zero. Then for the root /i + icj the value of 
F(\T) corresponding to o)T = (2s+ l)7r-e is approximately 

F(XT) = 2^(28 +l)^+i e y ({2s +l)*^+e*y .(3) 

so that the terms in (1) corresponding to the two roots fi ± iu> contribute 
{Z+i V )F(\T) + (£-i V )F(\T) 

- 4 (< 2 * +1 ^ +c ')- 



This remains always finite, but changes from a large quantity of one 
sign to a large quantity of opposite sign as e passes through the value 
(2s+l)/M£l<i)ri. The graph of f2 (T) in the case where £, rj and [i are all 
positive will therefore be as represented by curve No. 2 in Fig. 11*7*1. 
Hence a new group of real roots T has been gained, situated close to 
the values n/o), Sn/a), etc. If T s+1 denotes the (s -b l)th of the new set of 
roots, then with e — (2s-j- l)pen£,I ojtj this root is approximately 

.(4) 

The corresponding value of F(\T 8+1 ) given by (3) reduces to 


F(W e+1 ) = 


2<DT)(r} + i£) 
( 24 + 1 )^* + ^)’ 
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and, if the important terms only are retained in (11'6*4), the initial 
displacement in q m is approximately given by 

*•«•>«• - f±£w 

= 4ijl(28 + l)fiir. 

Similarly, the initial acceleration is approximately given by 
4mV 0 )l R m = ~ 4<t)hl/(28 + 1)/17T. 

It follows that the conditions (C) of § 11-6 for a positive initial dis¬ 
placement and a negative initial acceleration are both satisfied if 

V l/*>0- .( 5 ) 

It is to be noted that when /i is very small (and tj + 0) the initial dis¬ 
placement q m (t 0 ) is very large. 

With regard to condition (D) of §11-6, when the.selected semi- 
periodic time is T 8+1 equation (11-6-7) contains only two important 
terms and reduces approximately to 

——sin <o(t — 1 0 ) as 0, 

(2a+ 1)/in v 0/ 

and the lowest root is thus t — t 0 = n/o). It may be concluded that 
equation (11-6-7) necessarily possesses a root t —1 0 lower than any of 
the set T 8+1 except when 5 = 0. Hence, as regards this particular set 
of roots, the lowest—namely T x —is the only one which actually leads 
to steady oscillations. 

11*8. Stability of the Steady Oscillations. The stability of the 
steady oscillations when slightly disturbed can be investigated as 
follows. Let T denote the semi-periodic time for the undisturbed 
oscillations: thus T is assumed to be a root of (11-6-3). Also let A r be 
the value of the typical reducing variable at the starting instant of any 
down-stroke in q m in the steady oscillations, so that (see (11-6-2)) 

= ^m7r e TX r+ i’ .(*) 

for r= 1,2, ...,n. 

For simplicity adopt t 0 = 0 as the datum starting instant and assume 
this to be appropriate to a down-stroke in q m . Then in the undisturbed 
motion the successive terminal instants are 0 , T, 2T, ..., sT. 

.Now assume a small disturbance to occur at t = 0, and in the dis¬ 
turbed motion let the successive terminal instants for the strokes 
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become t x = T + e v t % — 2 T + e 2t t 8 = sT + e g . Lastly, let 2 R m y r 8 r 

be the increment of the typical initial reducing variable in the dis¬ 
turbance, so that for the disturbed motion 

«r(<o) = -4 r + 2R m y T S r . 

Only first-order terms in e and 8 will be retained in the analysis. 

The formulae obtained in § 11-4 may now be used with the simpli¬ 
fications r = t 0 = 0. To illustrate the treatment, consider firstly the 
equation which gives the terminal instant t x in the disturbed motion 
(see (C) of § 11-4). This is in the present case 

£ Kk mr eWJ(-A r -2R m y r S r +R m y r ) = 0, 

r=l 

and on substitution for A r from (1) and expansion of the exponential 
we have approximately 

2Rm Sl Xrkmr7reT ^ 1 + €l Ar) (eTAr V i ” = 0- 

The terms independent of e and 8 may be omitted in view of (11*6*3) 
and (11*4*3). If we write for brevity 

e T *rsx r , A •k mr y r l(l + e T *r)mP rt A r k mr y r d r =Q r , 

the equation gives to first order 

e x J:P r x r -J:Q r x r = 0, 

where the summations are taken for all the n roots A r . 

A similar treatment of the more general equation (11*4*14) yields 

2 P r x r {<f> 3 -<t> s -\X r ++... + (-1 ) 8 - 2 <f> 2 K~ 2 +(- i) 8-1 

+ (-l)*S^ = 0, 

in which <f> 8 = e 8 — e 8 _ x for s > 1 and <f> x == e v The s relations which serve 
to determine the values of <f> v (j> 2 ,<f> 8 are accordingly 

5jP r x r <f>i = 2 Q r x r , 1 

2 P T x T {<}> a - xrfi) = - £ Q r x*, 

ZPrXrWa-Xrfa + ^M = 2 QrA> 


(2) 


2 P r*r(& - *r&-l + At,-* +••• 

+ (-l)*- 2 a --V 2 +(-l) 8 - 1 ^-Vl) = (-l) 8 " 12 <? r ^ 
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These can be solved by the following artifice. Let z ly z 2 ,..., z nHL denote 
the roots of the algebraic equation in z, 

(z+ajj) (z+ar a )... (z+z n ) 2^j^~- == 0. 

Then, for a general value of the variable z, we have 


2 Q r x r l(z+ ay) (z+a^) (z + s a )... (z+ xj 2 Qr x rK z + «V) 
S P T x r j(z + x r ) Const, x (z—Zj) (z - z 2 )... (z- z n _ x ) 


The numerator and the denominator of the rational fraction on the 
right are both of degree n — 1 in z, and by resolution of the fraction into 
simple partial fractions we can therefore derive an identity of the form 


V Qr X r l bt . Ei . . 

z+x r \ 0 + z—z 1 z —z 2 


+ 


Z-Zn-1 


) 


V P r x r 
z + x r ' 


(3) 


where E Ql E Xi ..., E n _ ± are constants which we shall not require to 
determine in the present discussion of the stability. Now if the modulus 
of z is assumed sufficiently great, the expressions on the left and on the 
right of (3) can legitimately be expanded in powers of 1/z to give 


sMl-*44+-) 


Hence, on collecting together the coefficients of the separate powers of 
1/z and equating the results to zero, we obtain the sequence of relations 


T,P r x r E 0 = Z,Q r x r> 

2 P r x r[ ^ Ei - = - 2 Q r xl, 

2 P r x r { 2 E i z i -x r 2 Ei + x 2 r E 0 ) = 2 ^, 4 , 

u-i t=i / 

and so on. A comparison with (2) shows immediately that <f> ± = E 0 , 
and that for s > 1 n _ x 

( j >8 = € 8 -e 8 _ 1 = S EitfT 2 - .(4) 

i=i 

It may be concluded from (4) that if every modulus | z i | is less than 
unity, then <fi 8 tends to zero as s increases indefinitely; but that if any 
modulus exceeds unity, then <j> 8 tends to grow large. Now the time 
interval for the 5th stroke in the disturbed motion is T + <f> 8 . In the 
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first case therefore the motion regains its original constant semi¬ 
period T, whereas in the second case the motion is unstable. The results 
of this investigation may be summarised as follows: Let z 1 , z z , .. z n _ t 
denote the roots of the equation 

n pTk r 

S A 1 - +e ^ n r (2 + e. TX ») = 0. .(5) 

Then the steady oscillations corresponding to the semi-periodic time T 
will be stable or unstable according as every modulus | z i | < 1 or as any 
modulus | z i | > 1. 

Example 


Case of Complex Pair of Roots with Real Part Small. A case of 
interest is that considered under heading (ii) of § 11*7, where the real 
part fi of one pair of conjugate roots A x = /i + io) and A 2 = fi — io) is 
numerically small. It was there shown that the lowest root of the 

series T 8+1 , namely that root having the value T x = — may 

be expected to lead to genuine steady oscillations provided that rjjfi > 0. 
If this value for T is adopted in (5), the equation contains only two 
important terms and reduces to 

(z 4 - e T i A *) (z -f e T i A *)... (z 4 e T ^) W = 0, 

( z 4 

e Tl(/f+ ^ } + conjugate. 


Now to first order 
and 


e T x (ii+i*>) = --l4^(_tf4 i£\ 
0 ) 1 ] 1 

gTjiji+iu)) ifjjq 

1 4 gTiOt+iw) 4 


Hence after some reduction we find 


W = - 



The approximate values of the roots z i for the case considered are 
accordingly ■*. *a» 

*i = l+'~,z z = ~e“ , .... z n _ 1= —e ".(6) 


Consequently, if the real parts of all the roots A other than A x and 
are negative, then the motion is stable or unstable according 
< 0 or >0. 


S ^ 
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11*9. A Graphical Method for the Complete Motion of 
Binary Systems. The computation of the motion which follows any 
given disturbance is, in general, a matter of considerable difficulty. In 
the special case of a system having two coordinates q v q 2f of which q 2i 
say, is frictionally constrained, the solution can be greatly simplified 
by the use of a graphical treatment, provided that only rough results 
are required. Some remarks follow regarding the underlying principles 
of the method. 

The problem for solution is the description of the complete motion 
which results from any given initial disturbance. For simplicity we 
shall assume that this disturbance is imposed at the starting instant 
t 0 of a down-stroke in q 2 , so that initially q 2 (t 0 ) = 0. Attention will be 
restricted to the case in which the four roots of A(A) = 0 are complex. 
For the purpose of the graphical method it is convenient to denote 
these roots by Al - /i + i<o, A 2 = /*- ia>, 

A 3 = /j' + io )', A 4 = /j' — i(j)\ 
and to write A x k 21 oc 1 (t 8 )/R 2 = I g + iY 8 == Z 8 , 

A 2 k 22 oc 2 (t 8 )IR 2 = X 8 — iY 8 ~ Z 8 , 

A ^k 22 (x 2 (t 8 ) /R 2 = X 8 4- i Y 8 s Z 8i 

\k 2 iCti(t 8 )IR 2 = — i F 8 = Z a , 

together with A x k 2l y x s £ + irj = £, 

^ 2^2272 = £~^ = ^ 

Then if r B = t a — t 8 ~ t denotes the time interval for the sth stroke in q 2 , the 
recurrence relation (11-4-11) yields for the special system considered 
the four scalar equations 

Z 8 + (-l)‘£= (^ + (-1)^)^^, 
z s +(- iy£ = (Z^+ (-1)«£) 

.(1) 

z' a +(- 1)«r = (Z's-i +(-1 y n 

z'+(- i)-r = 


while the condition (11-4-3) requires that 

£ + f = 0. .(2) 

Further, since g 2 (f,) = 0 it follows from (11-4-13) that 

X'+X'.-O. .(3) 
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The foregoing relations may be interpreted geometrically by the use 
of two pairs of diagrams. Figs. 11*9*1 (a) and ( 6 ) represent the pair 
appropriate to the unaccented symbols, while two similar diagrams, 



referred to hereafter as Figs. 11 *9* 1 (a') and ( 6 ') but not actually drawn, 
would relate to the accented symbols. A description of Figs. 11*9*1 (a) 
and ( 6 ) follows. 

In Fig. 11*9*1 (a) the points D and U have respectively the co¬ 
ordinates (£,7/) and (-£, — 97 ), and are respectively marked “down” 
and “up”; while the points p » P» etc. have the coordinates (X 0 , Y t ), 
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(X v F*), etc. Now the equations ( 1 ) corresponding to the first down- 
stroke are Z ± -£ = (Z 0 -£) e^+^i. 

Hence, in the diagram, DP X = DP 0 e^ T i and the angle P 0 DP X = 
Again, the equations corresponding to the ensuing up-stroke are 

2 2 + £ = (Zi + 4')e<^^, 

+ f = (Z, + £)^-^2, 

so that C/P 2 = Z7ijC^ r a and the angle P x UP 2 = <ar 2 . 

More generally, DP^+i = DP^e^+i with the angle P 23 DP 2s+1 = c^^+i 
for the typical down-stroke, while UP 2a + 2 = UP 2s + x e^w+a with the 
angle P^+i UP^ 2 = wr 2s+2 for the typical up-stroke. These rotations 
and expansions of the successive radii can be effected conveniently by 
the use of the supplementary diagram Fig. 11-9-1 (6), which depends 
solely upon the first pair of roots A x , A 2 . The diagram consists of one 
or more complete turns of the logarithmic spiral 

x + iy = e^ ioi)t 

or r = e^, 

0 = (i)ty 

where t (time) is regarded as a variable parameter. The number of 
turns of the curve, and the choice of the scale, must be such that the 
radius vector r in Fig. 11*9-1 (b) embraces the range of values of the 
radii DP and UP to be covered in Fig. 11*9*1 (a). A scale for the time 
parameter t is marked along the arc of the spiral. 

The complementary pair of diagrams Figs. 11*9*1 (a') and (&') 
would be similar to the two just described, but would relate to the 
accented symbols. In view of the condition (2) the abscissae for the 
homologous centres D, D' (or £7, U') in the two displacement diagrams 
Figs. 11*9*1 (a) and ( a ') will have equal magnitudes but opposite signs. 
Again, the condition (3) —which is equivalent to the 5th terminal 
equation—requires that the abscissae of the homologous points 
P 3 , Pg shall also be equal and opposite. 

The actual manipulation of the two pairs of diagrams may now be 
explained. For the two displacement diagrams Figs. 11*9*1 (a) and (a') 
transparent graph paper is used; whereas the two spiral diagrams 
Figs. 11*9* 1 ( 6 ) and ( 6 ') are preferably prepared in ink on white unruled 
paper. It should be noted that the positions of the centres D, U and 
JD', U' in the two displacement diagrams are fixed by the dynamical 
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constants and are independent of the initial disturbance. The first step 
in the actual solution is to mark in Figs. 11-9-1 (a) and (a') the positions 
P 0 ,P'o, corresponding to the chosen disturbance: since the disturbance 
is assumed to occur at the starting instant of a down-stroke in q 2 , 
the abscissae of the initial points will be equal and opposite. To find 
the positions of P x and P[ hold sheet 11-9-1 (a) superposed on sheet 
11-9-1 (b) with D registered on the pole Q of the first spiral and P 0 on 
the arc of the curve; similarly, hold sheet 11-9-1 (a') superposed on 
sheet 11*9*1 (6') with D' registered on Q' and P' 0 on the second spiral. 
Then travel continuously in the anticlockwise direction along the 
spirals to the first positions P v P[ for which the increments of the 
parameter t are the same (r x ) in the two diagrams, and the abscissae 
are again equal and opposite. To find P 2 , P 2 from P v P' v the procedure 
is similar, but the up-centres U, U\ instead of the down-centres D, D\ 
are in this case registered on the poles of the spirals. Two observers— 
one for each pair of diagrams—are necessary. 

It is possible to obtain the full history of the complete motion 
resulting from an arbitrary disturbance by the use of the foregoing 
method. The velocities and displacements at any time t , not neces¬ 
sarily a terminal instant, can clearly be deduced by expressing the 
formulae (11*4*9) and (11*4*10) in terms of the current coordinates of 
the points P(t), P'(t) along the spirals. In particular, the value of the 
velocity q 2 (t)/R 2 at any stage is given by twice the sum of the current 
abscissae. A numerical illustration is given in example (iv) of § 12*9. 

It should be noted that the condition (11*4*15) for ankylosis at the 
terminal instant t s is expressible in the present modified notation as 

2( - 1 ) s (/ iX g - wY s +n’X’ s -<j)'Y' s ) + A-lJpo > 0, 
where A U A^-A l2 A iV 

This inequality can if necessary be tested from time to time as the 
work proceeds. An indication of the occurrence of ankylosis is, however, 
provided by the graphical method itself. For, in any genuine down- 
stroke, the starting acceleration in the frictionally constrained co¬ 
ordinate must be negative, while in any genuine up-stroke it must be 
positive. At the starting instant itself the velocity is zero. Hence, if, 
for instance, a down-stroke is due, and if after one time step, or fraction 
of a step (say St) the sum of the current abscissae of P and P' is negative, 
then SqJSt is negative and the stroke is realisable: whereas, if Sq 2 jSt is 
positive, ankylosis is indicated. 



CHAPTER XII 


ILLUSTRATIVE APPLICATIONS OF FRICTION 
THEORY TO FLUTTER PROBLEMS 


12*1. Introductory, {a) Flutter of Frictionless Aeroplane Struc¬ 
tures . In the simple theory of flutter* it is assumed that no solid 
friction is present in the aeroplane structure and that linear laws remain 
applicable throughout the motion. In this ideal case the ordinary 
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critical speed for flutter of any given type is defined to be the lowest 
forward speed of the aeroplane for which free oscillations of that type 
are steady (see § 9-8). For any speed below this critical value the oscil¬ 
lations resulting from any given small initial disturbance eventually 
die away; at the actual critical speed the motion tends to become 



* See for instance Ref. 30. 
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simply sinusoidal; while for all speeds over a certain range whose 
lower limit is the critical speed, oscillations occur which increase to 
an indefinitely large amplitude, however small the initial disturbance 
may be. The term critical speed will in the present chapter be used 
with the foregoing significance, and will, without further specific 
qualification, always refer to the ideal frictionless aeroplane. It is 
clear that at the critical speed for flutter the determinantal equation 
will have at least one pair of conjugate purely imaginary roots. It 
will be convenient to refer to one or other of these particular roots as 
the critical root . 

The flutter characteristics of an aeroplane can be represented by 
means of two diagrams, showing respectively the variation of damping 
factor fi (the real part of A = /i ± io)) and frequency a>/27r with airspeed, 
for the several constituents of the motion. Figs. 12*1*1 and 12-1-2 are ex¬ 
amples of a pair of such diagrams.* They relate to the rudder-fuselage 
flutter of an aeroplane which is referred to as aeroplane No. 1 in the 
sequel. The curves marked p and <o correspond to an oscillatory 
constituent which is damped for all speeds less than the critical value 
V c = 240 feet per second, and grows indefinitely for all higher speeds. 
The second constituent p ', o) f remains a decaying oscillation through¬ 
out the range of speeds covered by the diagrams.f 

It is possible for an aeroplane to be free from flutter throughout the 
range of its flying speeds, but to have a very small margin of stability 
in the vicinity of a particular flying speed. Fig. 12-1-3, which is purely 
diagrammatic, illustrates this condition. 

(6) Flutter in Practice . In the practical sense “flutter” means an 
oscillation which grows, and finally either breaks the structure or 
remains bounded at some amplitude whose value is dependent upon the 
departures from linear laws. As pointed out in § 11-1 (e), solid friction 
introduces a very special departure from linearity. 

(c) Applications to be Considered . The numerical examples to be 
given are restricted to tail oscillations involving angular displacement 
of the rudder and torsion of the fuselage. Moreover, they relate mainly 
to the simplest aspect of the theory—namely, the question of steady 
oscillation with either the rudder or the fuselage frictionally con¬ 
strained. 

* The curves drawn are based on data given in Chap, v of Ref. 42. 

t At a speed of about V = 580 this constituent in fact degenerates into a pair of sub¬ 
sidences. 
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It will be useful to review briefly the conditions for the existence and 
stability of steady oscillations in these applications. 

In accordance with the notation of § 11*7 we shall suppose the fric- 
tionally constrained coordinate (rudder or fuselage) to be q 2 , and 
E> + ii) to be the value of \h 2r y r for the critical speed, where A r is the 
critical root. If this root becomes fi 4- ico when the airspeed is slightly 
changed, then ji will be small and will be positive or negative according 
as the airspeed is above or below its critical value. It follows from 
equations (11*7*5) and (11*8*6) that if rj > 0, then for a certain range of 
the speed above the critical steady oscillations will be possible, but 
wall be unstable and therefore not realisable in practice. On the other 
hand for speeds below the critical, since /1 < 0, stable steady oscillations 
■will be possible provided that rj < 0. The particular oscillations here 
considered are those appropriate to the root T x of the semi-period 
equation which lies adjacent to the first asymptote (see Fig. 11*7*1). 

Unfortunately, the numerical data available for the calculations are 
very scanty. The first example given is that of an actual aeroplane 
(aeroplane No. 1), for which, when solid friction acts either on the 
rudder or the fuselage, steady oscillations are theoretically possible 
only at speeds above the critical; as already explained, these oscilla¬ 
tions are unstable. Aeroplane No. 2 is an artificial system, in the sense 
that its dynamical constants have been derived from those of aeroplane 
No. 1 by a transformation which changes the dynamical constants of 
the tail system, but leaves the roots of the determinantal equation 
unaltered at all speeds. Hence the tail flutter characteristics of the 
two aeroplanes are effectively identical. Nevertheless, when friction 
is present, the behaviour of aeroplane No. 2 is different from that of 
No. 1. On the modified aeroplane, if the friction acts on the rudder 
only, stable steady oscillations can occur at speeds below the critical. 
The graphical method of § 1 T9 is applied to provide a description of 
the complete tail oscillations under representative initial disturbances. 
The final example relates to another artificial system (aeroplane No. 3) 
which is completely stable at all speeds when frictionless, but which is 
shown to admit bounded oscillations at a particular speed when friction 
is introduced. 

A rather more detailed summary follows of the main conclusions 
for the three different aeroplanes considered. 

(d) Characteristics of Aeroplane No. 1 (see Part I). The numerical 
data for this case are appropriate to a full scale aeroplane, the tail 
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flutter characteristics of which have in fact been investigated by 
means of a model in a wind tunnel. From Fig. 12-1-1 it will be seen 
that the critical speed is about 240 feet per second. 

The results when the friction acts on the rudder only are as follows: 

(i) Speeds below the critical . The motion dies for all disturbances. 

(ii) Speeds above the critical . Large disturbances produce flutter and 
small ones decaying motion. Steady oscillations (constant amplitude 
and frequency) could result from a special initial disturbance, which 
varies with the airspeed and is directly proportional to the limiting 
friction. The steady oscillations appropriate to any particular speed 
are unstable, so that any slight variation from the correct initial 
disturbance will give rise either to a decaying motion or to flutter. 
Figs. 12*3*2 and 12*4*1 show the steady oscillations for the airspeeds 
260 feet per second and 500 feet per second, respectively. 

When the frictional moment is applied to the fuselage only, the 
results are similar. 

(e) Characteristics of Aeroplane No. 2 (see Part II). Aeroplanes 
Nos. 1 and 2 have the same damping-factor and frequency diagrams, 
and therefore effectively the same flutter characteristics. But their 
responses to disturbances at speeds below the critical are quite 
different in the case where the friction acts on the rudder. On 
aeroplane No. 1, if the airspeed is less than the critical value 240, the 
oscillations decay. This also occurs with aeroplane No. 2, provided the 
airspeed is less than about 215. On the other hand, for any speed 
within the range 215-240 two distinct sets of steady oscillations are 
possible, one (small amplitude) of unstable type and the other (large 
amplitude) of stable type. Consequently, very small disturbances 
produce motions which die; moderate ones give rise to oscillations 
which grow and finally tend to the large amplitude stable steady motion: 
still greater disturbances result in oscillations which decrease to the 
same stable motion. 

The histories of the motion due to three different initial disturbances, 
for an airspeed of 230, are represented in Fig. 12*9*2. Curve B is the 
unstable (small amplitude) steady motion, the angular amplitude of 
the rudder in this motion being about 0*039 degree per foot-pound of 
(limiting) frictional hinge moment applied to the rudder. The corre¬ 
sponding amplitude for the stable steady oscillations (not actually 
shown in the diagram) is 0*278 degree per foot-pound of friction. Any 
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disturbance similar to, but less than, that required to initiate oscilla¬ 
tions J3, gives rise to a motion which decays (e.g. oscillations A): 
whereas any similar disturbance greater than that corresponding to 
motion B produces oscillations which increase to a limiting amplitude 
of 0-278 degree per foot-pound of friction (e.g. oscillations C ). A maxi¬ 
mum amplitude increase of about 7:1 is thus to be expected at the 
particular airspeed considered. 

The theory shows that the amplitude of the stable steady oscillations 
increases very rapidly as the critical speed is approached. On the 
other hand, the amplitude of the unstable set is not greatly affected. 
Hence the effect of the friction in the present instance virtually amounts 
to a slight, and somewhat indefinite, reduction of the critical speed. 

(/) Characteristics of Aeroplane No . 3 (see Part III). This aeroplane 
is free from tail flutter at all airspeeds, but its margin of stability is low. 
The feature of interest is the fact that at an airspeed of 230, its oscil¬ 
latory characteristics, when friction is present, are identical with those 
of aeroplane No. 2. This shows the possibility of bounded free oscilla¬ 
tions occurring on an aeroplane which has no true critical speed. 


Part I. Aeroplane No. 1 

12 * 2 . Numerical Data. The system here considered is the aero¬ 
plane of which the tail flutter characteristics are discussed in Chapter v 
of Ref. 42. The two degrees of freedom* correspond to angular dis¬ 
placement q 1 of the tail unit in torsion (measured positively when the 
starboard tailplane moves downwards) and angular displacement q 2 
of the rudder (measured positively when the rudder trailing edge moves 


Table 12-2-1 

Dynamical Coefficients for Aeroplane No. 1 


Fuselage (g x ) 

Rudder ( q a ) 

Coefficient 

Value 

Coefficient 

Value 

An 

44-7 


-115 

Bn 

1*77 V 

B tl 

0*041 F 

On 

33,700 

On 

0 

A t , 

-1*15 

A n 

0*745 

Bit 

-O-180F 

Bn 

0*034 F 

On 

- 0*101 F* 


0*00358 F a 


* The coordinates q l9 q t correspond respectively to Q and f of Chap, v of Ref. 42. 
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to port). The dynamical coefficients appropriate to the frictionless 
aeroplane are reproduced in Table 12-2-1 from Table 43 of Ref. 42: 
for simplicity the gravitational cross stiffness is neglected. 

Except where otherwise stated angles are assumed to be measured 
in radians, moments in pounds feet, stiffnesses in pounds feet per 
radian, and airspeeds F in feet per second. 

The determinantal equation for a general airspeed F is 

Po^+Pi^+P^+Pz^+P* = 0 , .( 1 ) 

where p 0 = 31-979, 

= 2-6717F, 

p 2 = 25106-5+ 0-111682F 2 , 
p 3 = 1145-8F +0-0104776F 3 , 
p 4 = 120-646F 2 . 

Table 12-2-2 gives the calculated roots for a number of different air¬ 
speeds: the four roots are denoted by 

A x = fi + io ), A 2 = p—ico, A 3 = p' + iu)', A 4 = p! — ia>'. 


Table 12-2-2 

Roots of Determinantal Equation for Aeroplane No. 1 


(Also applicable to Aeroplane No. 2) 


V 


a i 

V-' 

a/ 

0 

0 

280195 

0 

0 

100 

-1*6932 

27*3904 

-2*4838 

6*6277 

200 

-1*3826 

25*5809 

-6*9719 

13*4658 

230 

-0*3590 

25*4747 

-9*2487 

14*8973 

239*807 

0*0 

25*5811 

- 10*0174 

15 2049 

260 

0*6868 

25*9749 

-11*5477 

15*6327 

280 

1*2684 

26*5275 

-12*9647 

15*8515 

300 

1*7600 

27*1810 

-14*2929 

15*9176 

400 

3*1912 

31*2253 

-19*9002 

14*7198 

500 

3*7493 

35*9271 

-24*6357 

10*7665 


The critical speed for flutter is thus 239*8. 

12*3. Steady Oscillations on Aeroplane No. 1 at V = 260. 
(Rudder Frictionally Constrained.) The calculations for this case 
will be explained in some detail. The dynamical coefficients, as deduced 
from Table 12*2*1 with V — 260, given in Table 12*3*1. 

Further (see Table 12*2*2), 

A 2 =/£ dh = 0*6868 + 25*9749i, 

A 3 , A 4 ± iu' = - 11*5477 ± 15*6327i. 
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Table 12-3-1 


Dynamical Constants for Aeroplane No. 1 atV = 260. 


Fuselage (g 2 ) 

Rudder (q t ) 

Coefficient 

Value 

Coefficient 

Value 


44-7 

An 

-1*16 

Bn 

460-2 

B tl 

10-66 

c„ 

33,700 

0 tl 

0 

A lt 

-1-16 

A n 

0-746 

B„ 

-48-36 

B n 

8-84 

c» 

-6827-6 

£** 

242-008 


The values of the constants X r k 2r y r are given by (11-4-1), and the 


following formula is typical: 

^i^ai7x = (i) = (i) (AiA!+JB u Ai + C' u ).(1) 

A(A X ) A(Aj) 

The results are 

KKi7i = i + iy = (7-81516+ 9-38083i)10- 3 .(2) 

A 3 * 23 y 3 = i' + iy' = (-7-81516+ 35-23619i)10-®, .(3) 


while X 2 k 22 y 2 and X i k 2i y i are respectively the conjugates of (2) and (3). 
It should be noted that an immediate check of the accuracy is here 
provided by the condition (see (11-4-3)) 

2 A r fc 2r y r = 0. .(4) 

r—1 

The next step is the solution of the semi-period equation (11-6-3), 

11111111617 4 e TX r— 1 

0- . (6) 

e TX i _ i ^ sinh Tfi + i sin To) 

° W e TX i + 1 cosh T/i + cos To) 9 

and the expression appropriate to the root A 3 is similar. Hence (5) 
is reducible to 

Z1 s * n s * n k Tj^L ~ V ’ 8 * n _ a 

~ cosh T/i -f cos T(i) + cosh T/i’ + cos To)' ~ 

Curve No. 2 of Fig. 12«3-1 is part of the graph of Q(T) plotted against T. 
The lowest root of the set discussed in §11*7 works out, on close 
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approximation, to T x = 0*11856. Hence 

e T ih — l 
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e T i A i +1 
e T i A »— 1 


= 15*56+11*81*, 

= -1*0136 + 0*5293i. 


e T i A »+1 

The initial displacement q 2 (0)/R 2 and initial acceleration q 2 (0)/R 2 are 
next calculable by the formulae (11*6*5) and (11*6*6), namely 

4 


£2(0) _ y k v eTA '~ l 

R 2 


*2 

M °) 

R« 


4 e TX r-l 

= d 22 + ^2r 7r ~TA r ] * 


.( 6 ) 

•(?) 



Fig. 12-3*1 

From (2) and (3) 

*2i7i = (368*847-29M224) 10 - 6 , .(8) 

k 22 y 2 = (1697*198-753*774i) 10~ 6 , .(9) 

Af*2i7i = - 0*238299+ 0*209441i, 

X\k 23 y 3 = - 0*460581 - 0*529072i. 

Again a 22 = A n /p 0 — 1*39779. 

At this stage several checks on the accuracy are possible. Firstly, from 
(11*4*2) and the definition of the constants <j> it follows that 


02 — Qll/Pl - 2 &2r7r* 
r*» 1 
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The value of C? u /p 4 works out as 4132*09 x 10 -6 , which is in exact 

4 

agreement with the value of 2 &2r7r given by equations (8) and (9). 

r=l 

Secondly (see (11*4*4)), 

4 

2 r7r = “~ a 22 ==: “~-^ll/Po> 

r—1 

and the calculated values are — a 22 = — 1*39779 and 

4 

2 KK7r = - 1-39776. 

r—1 

The numerical checks are thus satisfactory. 



From equations (6) and (7) we find that 

g 2 (0)/i? 2 =+0*015712, 

?2(0)/£ 2 =-9*4713. 

Hence conditions (C) of §11*6 are satisfied. Conditions (D) require 
that the velocity q 2 (t) shall not vanish within the interval t = 0 to t = T. 
To decide this question we shall determine the actual motion of the 
rudder (j a ) during the first stroke. The displacement at any time t 
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during this stroke is 


£2(0 

R 9 


4 / 2e ,A ’- \ 

= r ? 1 * 2rrr ( 1_ I+^)> 


where T = 0-11856; while the velocity is given by the formula 


? 2(0 


e u r 




The results, which are shown plotted in Fig. 12-3-2, show that all the 
required conditions are satisfied. The dotted curves correspond to a 
simple harmonic motion, and are added for comparison. 

Equation (11-8-5) reduces in the present case to the cubic 
17-0109z 3 — 19-9916z 2 4- 2-50234z — 1 = 0. 


There is one real root z 1 = 1-0897, and two conjugate complex roots 
having the modulus 0-2322. Since | z x | > 1, the oscillations are unstable. 
It may be noted that the value of z x agrees quite well with the approxi¬ 
mate value 1-083 given by the formula (11-8-6). 


12-4. Steady Oscillations on Aeroplane No. 1 at Various 
Speeds. (Rudder Frictionally Constrained.) A summary of results 
for other airspeeds will now be given. Table 12-4-1 gives the values of 
the constants A^iTi and A 3 £ 23 y 3 . 

Table 12-4-1 


Values of \ 1 k 21 y 1 and h$k 22 y 2 for Aeroplane No. 1 
(Rudder Frictionally Constrained) 


V 


»n * 103 

239-807 (critical) 
260 

300 

600 

9-6379+ 6-8777i 
7-8152+ 9-3808t 
4-0923 + ll-2649i 
-2-5033+ 9-8069i 

- 9 6379 +40-7463i 

- 7-8152+ 35-2362t 

- 4-0923+ 28-7984» 

+ 2-5033+25-589H 


The graph of the function Q(T) appropriate to the critical speed 
= 239-8 is curve No. 1 of Fig. 12-3-1, and it has asymptotes 
situated at the values T — 7r/o>, Sn/a), etc., where njoj = 0-12281. 
The lowest non-zero root of the semi-period equation in this case is 
T = 0-32848, but the oscillations corresponding to this root can be 
proved to be spurious, since q 2 (0) and q 2 ( 0) are both positive. 

For a speed, say V c + e> very close to the critical value, the lowest 
non-zero root lies adjacent to the first asymptote of curve No. 1 and 
its value is approximately 

T = 0*12281 - 0-00023*?. 
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In this case, if terms in 1/e only are retained, 

e TA i -1 260+230* 

e TA »+l - e 

g a ( 0) 0-308 
ii 2 “ e * 

g a (0) 201-6 

e * 



Fig. 12-4-1 


Hence steady oscillations can occur if e is positive (i.e. for speeds just 
above the critical). However, if e is very small, the initial disturbance 
required will be correspondingly large, and the oscillations themselves 
will approximate closely to the sinusoidal type. 

Curve No. 2 has already been discussed in § 12-3, while curves No. 3 
and No. 4 relate respectively to V = 300 and V = 600. It may be 
noted that all the curves have the same slope (— £a 22 ) the origin 
T = 0. The calculated values of g 2 (0)/l? 2 and of <j , 2 (0)/.fi 2 for the lowest 
root T in the several cases considered are as follows. 
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Table 12-4-2 


V 

Lowest 
root T 

9.(0)/*, 

9.(0)/*, 

Remarks 

239*8 (critical) 

0*32848 

+ 0*005675 

+ 1*4624 

Condition (C) of 
§ 11*6 violated 

260 

0*11856 

+ 0*015712 

-9*4713 

) Theoretically 

300 

0*11340 

+ 0 006801 

-3*6527 

[■ possible, but 

500 

0*09111 

+ 0 002851 

-1*7573 

j unstable 


The first down-strokes in the rudder motion corresponding to the 
two cases V = 260 and V = 500 are shown compared against sinusoidal 
oscillations in Figs. 12-3-2 and 12-4-1. It will be seen that the deviation 
from the sine curve is quite appreciable in the case V = 500. 

12-5. Steady Oscillations on Aeroplane No. 1. (Fuselage 
Frictionally Constrained.) The oscillations when the fuselage is 
frictionally constrained have not been investigated in detail. To 
render the formulae of §11-6 directly applicable, the coordinate q 2 
must in this case be chosen to refer to the fuselage and the coefficients 
in Table 12-2-1 must be correspondingly transposed. The value of 
Al& 2 i 7 i appropriate to the critical speed works out as 
£ + i v = (-128-28 + 329-55i) x 10- 6 . 

Since r} > 0, the steady oscillations occur at speeds above the critical. 

Part II. Aeroplane No. 2 

12-6. Numerical Data. The complete set of dynamical coefficients 
required for an application of the theory to rudder-fuselage oscillations 
has only been measured for the one aeroplane already considered. In 
order to provide an illustration of a system which exhibits steady 
oscillations below the critical speed, it has been necessary to adopt 
rather arbitrarily a new set of coefficients. These have been derived 
by postmultiplication of the inertial, damping, and stiffness matrices 
appropriate to aeroplane No. 1 by a non-singular matrix u of suitable 
constants. In this case, if A, B, C refer to the modified system, we shall 
have A\ 2 +BX+C — (4A 2 + B\+C) u, 

and the roots of the determinantal equation for each airspeed will 
accordingly be unchanged by the transformation. The elements of u 
must, of course, be chosen such that the symmetry of the inertial and 
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elastic stiffness matrices is preserved and that the discriminants of the 
kinetic energy for the modified system are all positive. Moreover, in 
order that steady oscillations shall be possible on the modified system 
at speeds below the critical, we require 7/ < 0. By an application of 
(12*3-1), it is easy to show that this last condition can be expressed as 
(C n — A u o) 2 ) ( fi ' 2 + o)' 2 — (o 2 ) — 2(o 2 /i'B n < 0. 

By trial a suitable post multiplier is found to be 


u = T 1-0 0 1. 

_-10*0 7-47826J 


The dynamical coefficients for the modified system are summarised 

in Table 12-6-1. _ _. „ , 

Table 12*6*1 


Dynamical Coefficients for Aeroplane No. 2 


Fuselage (q x ) 

Rudder (g 2 ) 

Coefficient 

Value 

Coefficient 

Value 

A n 

56-2 

An 

-8-60 

B,i 

3-63 V 

B n 

- 0-299 V 

c„ 

33,700 + 1 01 

c„ 

- 00358 F* 

A lt 

-8-60 

A„ 

5-57130 

B lt 

-1-39096 V 

Bn 

0-25426 V 

c» 

- 0-75530 P 

Cn 

0-026772 V s 


The coefficients of the new determinantal equation are the same as 
those for equation (12*2*1), multiplied by the constant factor 7*47826. 


12-7. Steady Oscillations on Aeroplane No. 2. (Rudder 
Frictionally Constrained.) For the critical speed V c = 239*8 it is 
found that 

= ^ + ^ = (7*88129— l*41363i) 10~ 3 , 

A 3 *2373 = ^ + ^' = (“ 7*88129 + 15*29856i) 10~ 3 , 
and since rj < 0 stable steady oscillations are possible below the critical 
speed. 

Curve No. 1 of Fig. 12*7*1 shows the graph of ti(T) corresponding 
to the critical speed. The lowest non-zero root of the semi-period equa¬ 
tion is T = 0*09608, and this leads to the values q 2 (0)/R 2 = 0*000551 
and q 2 ( 0 )IR 2 = — 0*3474. The corresponding steady oscillations can be 
shown to be unstable. 

With V = 230, the calculated constants are 

£+i v = (7*51056-2*103011) 10~ 3 , \ 

£' + i v ' = (-7-51056 + 15-96533i) 10-»J . ^ 
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and the semi-period equation is found to have for two of its roots the 
values T x = 0*09937 and T 2 = 0*11607 (see curve No. 2 of Fig. 12*7*1). 
The initial displacement and acceleration corresponding to the lower 
root Tj are 

? 2 (°)A B 2 = 0*000681 and g 2 (0)/fi 2 = -0*4385; 
while those appropriate to T 2 are 

q 2 (0)/R 2 = 0*00485 and q 2 {0)/R 2 = - 3*312. 

In both cases condition (D) of § 11*6 is also satisfied.* An application 
of the criteria for stability shows that the first set of oscillations are 
unstable and the second set stable. This suggests that with aeroplane 


0*02 

n(r) 

0 


- 0-02 


0 0-05 0-10 T 0-15 0-20 

Fig. 12-7-1 

No. 2, for a certain range of the airspeed below the critical speed, a 
moderately large disturbance may produce oscillations which grow 
to a relatively large constant limiting amplitude, but that a smaller 
disturbance will produce decaying oscillations leading ultimately to 
ankylosis. The limiting amplitude to be expected from growing 
oscillations will increase continuously the closer the speed approaches 
the critical value. 

From Fig. 12*7*1 it is seen that at a speed as low as V = 200 (curve 
No. 3), the two real roots corresponding to T x and T 2 have disappeared, 
so that steady oscillations are no longer possible. Hence at this speed 
the oscillations due to any disturbance eventually die away. 

* This is confirmed by the graphical method in § 12-9. 
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12*8. Steady Oscillations on Aeroplane No. 2. (Fuselage 
Frictionally Constrained.) The particular transformation used to 
derive the dynamical coefficients for aeroplane No. 2 from those of aero¬ 
plane No. 1 is such that the coefficients A 22 , ®22 and C 2 2 in Table 12*6* 1 



Fig. 12*9*1 (a) 

are all proportional to the corresponding coefficients A 22i B 22 and C 22 in 
Table 12-2-1. Since also the roots of A(A) = 0 appropriate to the critical 
speed are the same in the two cases, it readily follows that the conclu¬ 
sions drawn in § 12*5 are equally applicable to aeroplane No. 2. Hence 
when the fuselage is frictionally constrained, the steady oscillations on 
aeroplane No. 2 occur above the critical speed, and are unstable. 

12*9. Graphical Investigation of Complete Motion on Aero¬ 
plane No. 2 at V = 230. (Rudder Frictionally Constrained.) 
In § 11*9 a method is given for the graphical description of the com- 
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plete motion of a binary system. This method will now be applied to 
aeroplane No. 2 for V = 230, which is about 10 feet per second below 
the critical speed. Throughout, the friction is assumed to be applied 
to the rudder only. 



Pig. 12-9-1 (o') 


The appropriate values of A T k 2r y r are (see (12-7-1)) 

-*i*2i7i = (7-51056-2-10301t) 10-*, 

*. s k 23 y 9 = (~ 7-51056 +16-96533*) 10" 3 . 

Hence, in displacement diagram No. 1 (Fig. 12-9-1 (a), corresponding 
to Fig. 11-9-1 (a)), the down-centre D and up-centre U have, respectively, 
the coordinates ( + 7-51 xlO -3 , — 2-10xl0 _3 ) and ( — 7-51 x 10 -3 , 
+ 2-10x 10~ 3 ); while in displacement diagram No. 2 (Fig. 12-9-1 (a')) 
D‘ is the point (-7-51 x 10~ 3 , 15-97 x 10~ 3 ) and V' is ( + 7-51 x 10~ 3 , 
-15-97 x 10- 3 ). 
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Spiral No. 1 (Fig. 12-9*1 (f>), corresponding to Fig. 11-9-1 (6)) is the 

CUrVe r __ g/tf/a g—0*0140945^ 

where 0 is the angle of rotation of the radius r in radians, and 0 = iot . 
For convenience, the arc is divided into equal steps 0 = 10 degrees 



or 7r/18 radians, so that each step represents a time interval of 
t — 7r/18a> = 0-0068512 sec. Spiral No. 2 (Fig. 12-9-1 (6')) is the curve 

— gfi'O'lu)' = g—0*620835' 

where 0' = w't, and its arc is divided into steps representing the same 
time intervals as for No. 1. Thus d'/d = cj'/oj, so that a step in 0 
amounting to 10 degrees corresponds to a step in 0' amounting to 
5-8479 degrees. 
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Examples 

(i) Steady Oscillations Corresponding to T x = 0-09937. This is one of 
the cases discussed in § 12-7. The relevant data are 

A = (5-1738 + 24-1772*) 10~ 3 , 

T A. 1 

A 3 * 23 y 3 ^A~qii = (- 5,1738 - 15 - 7506i ) 10 ~ 3 - 

Hence the positions of P 0 , P' Q (which are fixed by the initial conditions 
for the first down-stroke) are 

P 0 = (5-17 xlO" 3 , 24-18 xlO- 3 ), 

P' = (-5-17X10- 3 , -15-75 xlO" 3 ). 

These points are marked on the displacement diagrams Nos. 1 and 2, 
respectively, and the description of the motion is then begun. The first 
operation is to superpose displacement diagram No. 1 on spiral No. 1, 
with D and P 0 registered, respectively, on the pole and on the curve; 
the second pair of diagrams is superposed in a similar manner. The 
two observers (one for each pair of superposed diagrams) now follow 
the spirals with pointers, proceeding by equal time steps in the 
positive sense, so that in each case the radius vector DP is shrinking. 
In the early stages the abscissae of P, P' (initially numerically equal, 
although of opposite signs) begin to diverge; but after about 14-5 
steps (i.e. about 145 degrees on diagram No. 1 and 85 degrees on 
diagram No. 2), they again become equal and opposite. At this stage 
the velocity again vanishes, and the first down-stroke is complete. 
It is found that the new positions of P and P' (namely P 1 and P[) are 
such that 0 lies on and bisects P Q P l7 while similarly O' lies on and 
bisects PqP [. This shows that the motion is steady. Further, the angle 
of rotation d amounts to 145 degrees, and this corresponds to a semi¬ 
period T x = 0*0993 sec., in good agreement with the value obtained by 
direct calculation. 

(ii) Steady Oscillations Corresponding to T 2 = 0 11607. Here the 
points Po,P’o have, respectively, the coordinates 

P 0 = (4-11 x 10- 3 , 82-14 x 10- 3 ), 

P' = (-4-11 xlO- 3 , -19-00x 10- 3 ). 
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The procedure is similar to that described for the previous case. It is 
found that the stroke is complete after 16*9 steps, and that the motion 
is steady. The value of the semi-period derived by this method is 
T 2 = 0*116. 

(iii) Growing Oscillations following a Particular Disturbance . The 
initial displacements and velocities are here all assumed to be 50 per 
cent in excess of those appropriate to the (unstable) steady oscillations 
considered in example (i).* Thus 

P 0 = (7*76 x 10~ 3 , 36*27 x 10~ 3 ), 

P' = (-7*76xl0- 3 , - 23*62 xl0~ 3 ). 

Table 12*9*1 


Synopsis of Results for Example (iii) (Growing Oscillations) 


Stroke 

no. 

Type 

No. 

of 

time 

steps 

Time 

of 

stroke 


Coordinates of Z 

a 

Coordinates of Z' s 

X g » 

10 8 

Y b x 10 s 

x;xio* 

Y'.xW 

Initial 

Final 

Initial 

Final 

Initial 

Final 

Initial 

Final 

1 

Down 

15-7 

0-107, 

7-75 

-7-2 

36-25 

-35-9 

-7-75 

7-2 

-23-6 

16-3 

2 

Up 

1615 

0-110, 

-7-2 

4-0 

-35-9 

36-7 

7-2 

-4-0 

16-3 

-16-9 

3 

Down 

16-6 

0-107 

4-0 

-4-75 

36-7 

-37-5 

-4-0 

4-75 

-16-9 

17-5 

4 

Up 

16-75 

0-108 

-4-75 

4-75 

-37-5 

38-4 

4-75 

-4-75 

17-5 

-17-5 

6 

Down 

15-7 

0-107, 

4-75 

-4-75 

38-4 

-39-25 

-4-75 

4-75 

-17-5 

17-5 

6 

Up 

15-85 

0-108, 

-4-75 

4-75 

-39-25 

39-9 

4-75 

-4-75 

17-5 

-17-5 

7 

Down 

15-85 

0-108, 

4-75 

-4-75 

39-9 

-40-7 

-4-75 

4-75 

-17-5 

17-6 

8 

Up 

15-9 

0-109 

-4-75 

4-65 

-40-7 

41-5 

4-75 

-4-65 

17-6 

-17-6 

9 

Down 

15-95 

0-109, 

4-65 

-4-6 

41-5 

-42-5 

-4-65 

4-6 

-17-6 

17-6 

10 

Up 

16-0 

0-109, 

-4-6 

4-6 

-42-6 

43-7 

4-6 

-4-6 

17-6 

-17-5 


The results obtained at the terminal instants of the first 10 complete 
strokes of the rudder are given in Table 12*9*1. To obtain a more 
detailed description of the motion it is necessary, as explained in 
§ 11*9, to note the current positions of P and P' on the spirals for a 
number of the individual time steps. The value of the velocity q 2 (t)IR 2 
at any stage is then twice the sum of the current abscissae, while the 

Z(t) Z'(t) 

corresponding displacement q 2 (t) /P 2 is twice the real part of -f . 

A 3 

Alternatively, the displacement can be deduced by integration of the 
graph of the velocity. Curve C of Pig. 12*9*2 shows the complete motion 
determined by the first method for the first nine strokes of the rudder: 

* Note that if the displacements and velocities in any initial disturbance are all increased 
proportionally, then the initial reducing variables will also all increase proportionally. 
Hence, in the present case the vectors OP 0 and O'Pq will be 50 per cent in excess of those 
appropriate to example (i). 
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the growth of the oscillations is obvious. However, this growth cannot 
continue indefinitely, and it is clear that the ultimate motion will be 
the stable steady oscillations discussed in example (ii). The initial 
angular displacement of the rudder is in the present case 0*0585 degree 
per foot pound of frictional moment P 2 , while the maximum displace¬ 
ment is 0*278 degree per foot-pound. The ratio of initial to final ampli¬ 
tude is thus about 1:5. 



(iv) Decaying Oscillations leading to Ankylosis . For this example the 
initial displacements and velocities are assumed to be only three- 
quarters of those appropriate to the (unstable) steady oscillations of 
example (i). The resulting motion is found to be complete for four 
strokes of the rudder; temporary ankylosis then occurs, and eventually 
permanent ankylosis. 

The initial data for the first stage of the motion are 
P 0 = (3*9 x 10- 3 , 18*1 x 10~ 3 ), 

P' = (_3*9 xlO- 3 , — 11*8 x 10“ 3 ), 

corresponding to a rudder displacement q 2 (0)/R 2 of about 0*51 x 10~ 3 . 
The displacements at the stopping instants of the first four strokes are 
found by the graphical method to be respectively 
— 0*215 x 10- 3 , 0*305 x 10~ 3 , -0*131 x 10~ 3 , and 0*087 x 10~ 3 , 

while the variation of velocity q 2 /R 2 during each stroke is determined 
by addition of the abscissae of P, P' for corresponding positions along 
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the spirals. A graphical integration of the velocity diagram, controlled 
by the known values of the displacements at the terminal instants, 
yields the final graph of the rudder displacement (see Fig. 12-9*3) 
during the first four strokes. 

An attempt to continue the graphical operations, with the down- 
centres D, D ' registered on the poles of the spirals on the supposition 
that complete motion continues with a down-stroke, leads immediately 
to the contradictory result that the starting acceleration for the fifth 
stroke is positive. This indicates ankylosis. The measured positive 
starting acceleration does not here mean that complete motion ensues 



with an up-stroke. A genuine up-stroke would be realisable only if 
the acceleration happened to be positive with the up-centres U, V' 
registered on the poles of the spirals: this is not true for the present case. 

In order to determine the ankylotic motion we may use either an 
analytical or a graphical method. The analytical treatment will be 
illustrated here. The differential equation for the ankylotic motion is 


(see §11 5) -^iiSfi + -®nffi + Cii?i “ —Cwfaih)* . (1) 

where q 2 (t A ) is the constant value of the ankylosed rudder coordinate. 
Further, ankylosis will persist so long as the quantity 

(^21 ?1 + ^21?l + ^212l + ^22?2^4))/-®2 .(2) 

is numerically less than unity. 


The first step is to derive the initial displacements and velocities of 
the fuselage in the ankylotic motion from the corresponding known 
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values of the reducing variables: for this purpose the formulae (11-3-1) 
and (11-3-2) can be used. The values of Z, Z' appropriate to the 
stopping instant t i of the fourth stroke in the complete motion are 
Z t x 10 s = 5*8 + 11-82 and Z\x 10 3 = — 5-8 — 12-2i. Hence 

Ai*2i Oi(<«)/i^ = (5-8+ll-8i) 10- 3 , 

*3**3«3(* 4 )/S» = (-5-8- 12-20 10- 3 . 

Also fii = = 0-64057 - 0-09874i, 

*21 /ll(*l) 

^13 = = 0-49702 + 0-054520 

^23 J ll(^3) 

The required initial values, which can now be calculated, are 
?i (^)/^2 = 0*058 x 10- 3 and q 1 (t A )/R 2 = 5*32 x 10' 3 . 

Next, on substitution of the numerical data, equation (1) becomes 
q x + 14*856^+1550*34^ = 62 x 10 ~ 3 R 2 . 

The solution appropriate to the initial values just determined is 
I0 3 xq 1 (t)/R 2 = 0*040 — 0* 142e -7 * 428 * cos 0, 

where <J> (assumed expressed in degrees) has the value 2215*5£ +97*3, 
and t = 0 is temporarily adopted as the starting instant for the an- 
kylotic motion. At t = 0 the quantity (2) has the value 0*5674; this 
confirms the occurrence of the ankylosis. However, at t = 0*0093 
the value is + 1-0, so that at this instant complete motion is resumed. 
The positive sign for (2) indicates that the ensuing stroke in q 2 will be 
downwards. 

To proceed again by the graphical method for complete motion a 
preliminary conversion to the reducing variables is necessary. The 
formula appropriate to the root A x may be written (see 11*3*8) 

{^n a i, ^21 a i} = (T 7 ~~ (Oq — Ai Aq ), 

A 1 A(A 1 ) 

and the numerical data actually required for the computation of 
*i*2i a i are Cq _ f 5493-1 - 5824-02 1, 

[-82-52+ 891-23iJ 
~/2i(*i) = -3710-82+1594-58i, 

/ U (A X ) = 5036-94 + 20240-78i, 

(i) 

-1/A(A 1 ) = (0-11394-0-08755i) 10"«. 
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Hence X 1 k 21 a 1 /R 2 = Z = (2-469+ ll-299i) 10“ 3 ; 

and a similar calculation gives 

A z k 22 ocJR 2 = Z' = (- 2-469 -9-869i) 10- 3 . 

These results fix the positions of the initial points in the displacement 
diagrams, and the next stroke can now be followed out in the usual 
manner. The velocity throughout the stroke is found to be negative, 
but always very small, and at the conclusion of the stroke permanent 
ankylosis is readily proved to occur. 

The full history of the behaviour of the rudder is represented in 
Fig. 12-9-3. 

Comparison between Types of Motion in Examples (i), (iii) and (iv). 
A co.mparison between three types of rudder motion on aeroplane 
No. 2 at V = 230 is given in Fig. 12-9-2. Graph A shows the decreasing 
motion just described in example (iv); graph B corresponds to the 
steady unstable oscillations, with the semi-period T = 0-0994, con¬ 
sidered in example (i); lastly graph C relates to the growing oscillations 
discussed in example (iii). The initial disturbances appropriate to the 
three types of motion can for brevity be referred to as 0-75X, X , and 
1-5X respectively. Here X denotes the complete disturbance required 
to produce the steady oscillations, and is specifically 

q 1 (0)/R 2 = 0-0004865, q 2 (0)/R 2 = 0-000681, 
qi(0)/R 2 = 0-007977, q 2 (0)/R 2 = 0. 

As already remarked, the motion C cannot grow indefinitely. It tends 
ultimately to the steady stable oscillations with the semi-period 
T = 0-116. 



Part III. Aeroplane No. 3 

12*10. Aeroplane No. 3. A final example is given to show that 
solid friction can produce bounded oscillations of a dynamical system, 
even when the frictionless system is always stable. 

To derive a simple illustration we shall retain all the dynamical 
coefficients appropriate to aeroplane No. 2 (see Table 12-6-1) with the 
exception of the stiffness coefficient C 1V If the new stiffness coefficient 
is chosen to be 33,700 +1-01 V 2 + a(F 2 — 230 2 ), and a is arbitrary, then 
evidently all the dynamical coefficients for the new system—which 
we shall call aeroplane No. 3—will agree with those for No. 2 when 
V = 230. Hence at this particular speed the oscillatory characteristics 
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of the two aeroplanes, with or without friction, will be identical. In 
particular, the bounded oscillations already shown to be possible on 
aeroplane No. 2 will also be possible on aeroplane No. 3, irrespective 
of the value of a. It only remains to choose the constant a in such a 
way as to render aeroplane No. 3 stable at all airspeeds. A value for a 
which renders all the coefficients of the determinantal equation and 
the test function T 3 positive at all speeds is 0*6. The approximate values 
of the roots of A(A) = 0 for a few representative airspeeds are tabulated 
below. 

Table 12-10*1 

Roots of Determinantal Equation for Aeroplane No. 3 


V 


U) 

m' 

a/ 

0 

0 

6-76 

0 

0 

100 

-0-59 

12-40 

-3*59 

6*70 

230 

-0-36 

25-47 

-9*25 

14-92 

350 

-0-32 

38-16 

-14-30 

22-51 

600 

-0-33 

54-21 

-20-55 

32-01 


It will be noted that, although the aeroplane is completely free from 
flutter, yet it is only slightly stable at all speeds. Naturally no actual 
aeroplane would have such characteristics, but it is quite possible for 
an aeroplane which is immune from flutter to approach very close to 
flutter at particular flying speeds (see Fig. 12-1-3). From the examples 
already worked out it seems clear that at such speeds the development 
of much friction in the controls might sometimes result in the occurrence 
of bounded oscillations. 




CHAPTER XIII 


PITCHING OSCILLATIONS OF A FRICTIONALLY 
CONSTRAINED AEROFOIL 


13*1. Preliminary Remarks. The theory given in Chapters xi 
and xii indicates that with particular aerodynamical systems possessing 
solid friction, steady oscillations can occur at airspeeds less than the 
critical speed for flutter. If the system has two degrees of freedom 
represented by the generalised coordinates q l9 q 2 , and if q 2 only is 
subject to frictional constraint, then stable steady oscillations are 
possible below the critical speed provided that (see §§ 11*7, 11*8, and 
equations (11-3-13), (11-4-1)) n m 


£+i7} = A 1 k 21 y 1 = 


i r • > t 3Ai 1 3Ai /ft . 

when, .(2) 

and A x denotes the critical root.* If A x = io) at the critical speed, and 
if 8/i is the increment of the real part of the root when the speed is 
slightly increased from the critical value, then the criterion (1) can be 
expressed as 2 


This inequality has a simple physical significance. It requires that 
instability of the frictionless system shall develop if, while the airspeed 
is maintained at its critical value, the direct damping coefficient 
appropriate to the coordinate q 2 is given a small positive increment. 

Steady oscillations of the type just referred to have been demon¬ 
strated in a wind tunnel! at the National Physical Laboratory. The 
system chosen for these experiments consisted of a single rigid aerofoil 
capable of pitching motion about two separate axes, and it was 
designed entirely from theoretical considerations. Part I of the present 
Chapter deals with some problems connected with the design of the 
apparatus, and Part II gives a brief description of the experiments. 


* For the definition of critical root see § 12-1. 


t Ref. 43. 
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Part I. The Test System and its Design 

13 # 2. Description of the Aerofoil System. The system as finally 
constructed is shown in Figs. 13-2-1, 13-2*2, 13-2*3. The two vertical 
axes about which pitching movements are allowed are A A and BB: 
these will be referred to as the “ frame axis’ * and the “aerofoil axis”, 




Fig. 13-2-3 


respectively. The rigid frame A ABB consists of a steel rod A A of dia¬ 
meter 0-5 inch, two arms of square channel section K y and a steel rod BB 
of diameter 0-25 inch. The frame is pivoted about AA as an axis. At 
the bottom is a point and cup support F , with a locking point O carried 
on a small bridge; the top support is a simple journal bearing H . The 
rod BB carries a pair of ball-bearings which support a rigid wooden 
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aerofoil of symmetrical section. Screwed to each end of the aerofoil 
is a metal plate E, and attached to each plate by metal screws is a 
circular steel disc C in the centre of which is the spherical ball-race. 
The attachment between C and E is such that it is possible to adjust 
the position of the aerofoil with respect to the axis BB; for this purpose 
the hole in the aerofoil, through which the rod BB passes, is of elongated 
section (Pig. 13-2*3). 

Bearing against the rims of the two discs C are two steel levers D 
(Fig. 13-2-2). These are operated by threads J which pass through small 
holes in the supports F, H and on the axis AA, and so to the outside 
of the wind tunnel. Small release springs L are also attached to the 
levers D. The arrangement is such that the motion of the aerofoil with 
respect to the frame can be constrained by solid friction applied 
externally; but any tension in the threads does not affect the motion 
of the frame relative to earth. 

The actual aerofoil used was made of yellow pine, and had a span of 
nearly 4 feet, a chord of 6 inches, and a maximum thickness of 0-75 
inch: the profile is shown in Fig. 13-2-3. The distance between the axes 
AA, BB was chosen to be 8 inches, and the axis BB was at 0-9 inch 
(i.e. 0-15 chord) behind the leading edge of the aerofoil. Stability for 
the frame was provided by attaching springs in pairs to the arms K, 
which were provided with hooks at various radii. These arms were also 
extended backwards so that, if necessary, the frame could be cross- 
braced and given additional inertia. 

For the measurement of small or moderately large angular displace¬ 
ments a small mirror was fixed to the shaft A A and used in conjunction 
with a spotlight and scale. Very large angular displacements were read 
directly on a quadrant. A second mirror attached to the aerofoil near 
the shaft BB was used in experiments with the frame locked. 

13-3. Data Relating to the Design of the Test System. The 

preceding description applies to the test system as constructed, but 
the final design depended on much preliminary theoretical work. The 
aerofoil itself, and the position of the aerofoil axis BB at 0-9 inch behind 
the leading edge, were treated as definitely assigned from the outset. 
However, certain of the inertias and spring stiffnesses, and the position 
of the frame axis AA, were left free to be varied. The calculations 
relating to the aerodynamical derivatives and to the inertias will now 
be summarised. 
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(a) The Aerodynamical Derivatives. The derivatives appropriate 
to coupled pitching and normal oscillations are deduced from the 
theory of the accelerated motion of an aerofoil in Ref. 44. The 
generalised coordinates corresponding to these two degrees of freedom 
are 6 (inclination in pitch) and z (linear displacement of the pitching 
axis normal to the chord). The senses of these coordinates are indicated 
in Fig. 13*3*1. 

The derivatives given by the theory depend through a parameter 
on the frequency of oscillation. We shall see later that in relation 
to the application in view this frequency parameter, which is defined 
as 0 )cjV (where w/27t is the frequency, c the chord, and V the airspeed), 
can be chosen at convenience: the value ojc/V = 0*4 will be adopted. 



If Z denotes the normal force and M the pitching moment about the 
aerofoil axis BB (prescribed at 0*15 chord behind the leading edge), 
the appropriate derivatives are as follows:* 


Damping Matrix. 93 = 


\-z* 

-Zs ] 

= 10 _3 F|"9-34 

1*40 " 


-Mi. 

0-467 

0*470. 


Aerodynamic Stiffness Matrix. 


\-Z. 

-z 9 i= io- s F 2 ro 

9*34 ' 

L -M. 

-M e \ [o 

0*467. 


For the investigation of the effects due to friction it is convenient 
to adopt for the generalised coordinates the angle of rotation q x 
about AA of the frame relative to earth and the angle of rotation 
q 2 about BB of the aerofoil relative to the frame (see Fig. 13*3*1). 
If l denotes the distance AB> these coordinates are connected with 
z, 6 by the linear substitution 

n. ® 

* Since the aerofoil spanned the wind tunnel no corrections for aspect ratio were 
attempted. However, a slope of 2-7 was assumed for the lift coefficient curve instead of the 
usual theoretical value tt. 
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or say {z,6} = uq. The aerodynamical damping and stiffness matrices 
B, W appropriate to the coordinates q x , q 2 are then given by (compare 
example to § 9-3) 


B = u'fSu = 

i-H 

1 

kr 

? 

o 

r—t 

1 r9-34 1-40 

]\~ l °1 


Lo l 

J [0-467 0-470 

111 ij 

W = m'SBm = 

> io- 3 F 2 r-z 

lire 9-34 ir 

-l 01. 


Lo 

lj LO 0-467J [ 

1 ij 


In particular, if l = f foot (as in Fig. 13-2-1), these yield 


'3-38 

-0-46' 

and W = 10- 3 F 2 r-5-76 

-5-76 1 

0-16 

0-47. 

0-467 

0-467J 



Fig. 13-3-2 


(b) The Inertias. The contour adopted for the aerofoil is a blunt- 
nosed quartic oval.* The half-thickness t at a distance £c behind the 
leading edge is „ „ 

-T 



where # is the maximum thickness-chord ratio. Hence if 8 is the 
span, y the material density, and K = f#sc 4 y(4)^, the inertias are given 
by (see Fig. 13-3-2) 

P = A lt = A n = Kfei(l-£)(h-g)(k + h-£)d£, 

in which he, kc are respectively the distance between the leading edge 
and A A, and the distance between A A and BB. 


* Defined in Ref. 46. 
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For the test system as finally constructed, 3 = 4 ft., c = 0*5 ft., 
# = 0*125, y = 0*75 slug/ft. 3 , h = 0*15, and k = 1*3333. The numerical 
values of the inertias for the aerofoil alone are then* 

A n = 19*1 x 10~ 3 slug ft. 2 , 

P = A 12 = A 21 = - 3*3 x 10- 3 slug ft. 2 , 

A 22 = 1-8 x lO -3 slug ft. 2 

For the complete system, the moment of inertia of the frame about 
the axis A A must be added to the value of A n appropriate to the 
aerofoil only, while the values of P and A 22 are also modified by the 
metal attachments to the aerofoil. Any addition of mass to the frame 
will increase A lx without altering P or A 22 . Hence A n is a separately 
variable parameter. 

13-4. Graphical Interpretation of the Criterion for Steady 
Oscillations. In connection with the design of the apparatus, two 
methods were developed for the discussion of the criterion (13*1*3). 
The first, a graphical method, is based on the use of a test conic,\ and 
it will be illustrated with reference to the following particular binary 
system. 

Table 13*4*1 


Dynamical Coefficients for Illustrative Binary System No. 1 


Coefficient 

Value x 10 3 

Coefficient 

Value x 10® 

^11 

350 

A U {==P) 

41 

Sn 

3-74F 

B S1 

0-703 F 

C„ 

6-14F 2 + 10 3 (7 1 

C'ji 

0-467 F* 

4(sP) 

41 


2-3 

B lt 

1-17F 

B n 

(0-47 +a) F 

c lt 

5-14 F 2 

c„ 

0-467 F 2 + lO 8 ^, 


These dynamical constants were estimated for an aerofoil system 
identical with that described in § 13*2, except that the frame axis AA 
was situated 6 inches forward of the aerofoil axis BB . The coordinates 
q x and q 2 are as defined in § 13*3 and the frietionally constrained co¬ 
ordinate is assumed to be q 2 . The spring stiffnesses <r l9 cr 2 are regarded 
as parameters temporarily left free for choice, while a in Table 13*4*1 
is a parameter ultimately to be made zero. 

* The slug is the unit of mass commonly adopted in aeronautics. A force of one pound 
applied to a mass of one slug produces an acceleration of one foot per second per second. 

t For a fuller description of the use of test conics in relation to flutter problems the 
reader should consult Chap, m of Ref. 30. 
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If we write C u x 10 s = 5-14F 2 +10 s (r 1 sXF 2 , .(1) 

C* 22 x 10 3 = 0-467F 2 + 10% = 7F 2 , .(2) 

the coefficients of the determinantal equation are 
10% = 63-69, 

10%/F = 17-3727+ 35-0a, 


10%/F 2 = 2-3.Z +35-07+3-74a—22-05341, 
10%/F 3 = (0-47 +a)X + 3-747 — 4-16981, 
10%/F 4 = 17-2-40038. 



The test function T a for stability (see (9-8-1)) is given by 
T 3 = PiPiPa-poPs-PiPv 

and when a = 0 the vanishing of T s defines the following quadratic 
relation between X and Y: 

0 = X 2 -9-60548(2X7) + 293-629 7 2 

- 10*3213(2X) - 210*211(27) + 258*154. 

If X, Y are regarded as rectangular coordinates in a plane, this equation 
represents a test conic , which is shown as conic No. 1 in Fig. 13*4*1. 
Elimination of V 2 between (1) and (2) gives the stiffness line 

<r x ( Y -0*467) = cr 2 (X-5*14), 

which always passes through the stiffness point Z = (5*14,0*467), and 
has the positive slope <r 2 l<r v It is seen from the diagram that Z happens 






13 - 4 - 13-5 


THE SECOND CONIC 


389 


to lie very nearly on the conic. The horizontal stiffness line (o* 2 = 0) is 
shown as ZB. When <r v or 2 are known, the intersection of the corre¬ 
sponding stiffness line with the conic fixes the critical speed for the 
system. 

The second curve (conic No. 2) shown in Fig. 13-4*1 represents the 

dT 

= — 0, which reduces to 

VX>;22 

0 = X 2 - 5*79196(2X Y) + 255*613 Y 2 

- 14*5159(2X) - 215*904(27) + 326*922. 

It is easy to see that points lying inside curve No. 1 correspond to 
T 3 <0 and hence to instability, and that points inside curve No. 2 
satisfy the condition dTJdB 22 < 0. Now consider any point on curve 
No. 1 which lies within curve No. 2. For such a point T z = 0 and 
dTJdB 22 < 0, so that an increment in B 22 leads to a negative value of 
T Zi i.e. to instability. Hence dfijdB 22 >0, whichisthe condition (13*1*3). 

There are two separate arcs of curve No. 1, namely ABO and DEF , 
which lie inside curve No. 2 and which thus satisfy the foregoing 
condition. The slope of the stiffness line must, however, be positive. 
Hence only those portions of the two arcs for which X > 5-14 and 
Y > 0*47 yield permissible values of the stiffnesses. The horizontal line 
ZB excludes the portion AB of ABC , while a very small portion of 
DEF is excluded by the vertical line ZO. Stable steady oscillations will 
therefore occur at speeds below the critical when friction is present, 
provided the stiffnesses <r l9 (X 2 are such that the stiffness line lies within 
either of the angles BZC or DZO. 

A system of the type discussed, with the frame axis AA forward of 
the aerofoil, was found to be open to certain practical objections and 
was not put into construction. 


condition 


ition(S) 


13-5. Alternative Treatment Based on the Use of Inertias 
as Parameters. The spring stiffness cr 2 is here assumed to be zero, 
and the stiffness <t x (and therefore C n ) and the inertias are left free for 
choice. The coefficients of the determinantal equation are 


Po = A u A 22 -P 2 , 

Pi ~ A ii -B22 + A 22 B X1 — P(B 12 + -B 2 i)> 

p 2 = ^11^22 + ^22^11 ~~ B(Ci2 + ^2l) 4- Bn B 2 2 ” *®2i> “ 

Ps = -® 22 ^ X 1 + -® 11^22 -® 12^21 “ -® 21 ^ 12 > 

P\ = C n C 22 — C 12 C 21 . j 


(1) 
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At a critical speed, = i(o, and 

p 0 u l -P2<*> 2 +Pi = 0 , .( 2 ) 

p x o)*-p z = 0. .(3) 


On substitution of the expressions (1) for the coefficients in (2) and (3) 
two equations are obtained from which C xx can be eliminated. The result 
of this elimination, which is found to be independent of A xx , is 

0 = ( d *{ P 2 B 22 — pA 22 {B X2 4 - P 2 i) 4 - A\ 2 B lx } 

+ (o 2 [P{C 22 (B 12 -f B 21 ) — B 22 (C X2 + ^ 21 )} 

+ ^22(^12^21 + ^21^12 ~ 2 P 11 C f 22 }] 

-f {o) 2 B 22 (B xx B 22 - P 12 B 21 ) + B 22 C 12 C 2 1 

— ^22(^12^21 + B 21 C 12 ) 4 - ^11^22}.(^) 

Since in this equation the four damping coefficients B^ and the three 
stiffness coefficients are proportional to V and V 2 respectively, the 
equation effectively involves only the unknowns P, A 22 , and the ratio 
(o/V. If the frequency parameter ojc/V, and therefore w/V, is arbitrarily 
assigned, a relation between P and A 22 results. 

Again, the formula (12*3*1) gives 

(i) 

and in the critical case, if A x = io), this can be reduced to 

g+irj = (<7 n -cj 2 A 11 + i<oB 11 )l2(<o 2 p 1 + iG)2G) 2 p 0 --p 2 ). ...(5) 

The numerator in (5) can be expressed in a form independent of A 1V 
for on substitution from (1) in (3) we obtain 

^22(^11 n) = < j ) 2 ( B 1 x A 22 — P B 12 4* B 2l ) — B n C 22 4 * B X2 C 2X + -®2i^i2* 
It follows that the expression (5) can be written 

E+iv = 

{&A XX 4 - 6 ) 4 - i{cA xx 4 - d) 9 


where a, /?, a, 6, c, and d are all independent of A xx and C X1 . The condition 

(13-M) requires that . . „ v //>f 

^ AiJotc — Ba) > (8b — ad). . (61 


The procedure is to assign at convenience the critical frequency 
parameter o>c/7, to choose a series of values of P, and to determine a 
corresponding value of A 22 from (4). These values of P and A 22 are 
then used to determine a, /?, a, 6, c, and d. Any value of A xx which 
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satisfies (6) and which exceeds P 2 /A 22 can then be assumed, and the 
system so defined will exhibit stable steady oscillations below the 
critical speed. 

For example, suppose the dynamical coefficients to be as given 
in Table 13-51. 

Table 13-5-1 


Dynamical Coefficients for Illustrative Binary System No. 2 


Coefficient 

Value x 10 3 

Coefficient 

Value x 10 3 

A n 

Bn 

On 

A t A=P) 

Bn 

On 

A n x 10 s 

3-38 F 

- 5*76 F 2 -f 10 s 

P x 10 3 
-0-46F 
- 5-76F 2 

A tl (=P) 

Bn 

C tl 

A n 

Bn 

On 

P x 10® 

0-16F 

0-467 F 2 

A ti x 10 3 

0-47 F 

0-467 F a 


The aerodynamical derivatives are here appropriate to the actual 
test system described in § 13*2. Since for that system c = 0-5 and the 
frequency parameter wc/V is assumed to be 0-4, we have oj = 0-8F. 

If C n x 10 s = — 5-76F 8 + l^sIP, .(7) 

the formulae (1) yield 

Po ~ ^11^22 F 2 , 

lO^pJV = 0*47^ n + 3-38^ 22 + 0-30P, 

10 3 p 2 /F 2 = 0A%1A X1 + A 22 X + 5-293P + 0-0016622, 

10 3 p 3 /F 3 = 0*47 X+ 2-71488, 

10 6 p 4 /F 4 = 0-467X + 2-68992, 
while equations (2) and (3) become 

0-4096p 0 F 4 -0-64p 2 F 2 +p 4 = 0, 

0-64p 1 F 2 — p z = 0. 

Equation (4), which is derived by substitution for the coefficients p 
and elimination of X, now becomes 

0 = 409600P 2 4-261447^[ 22 P + 2945634^L 2 2 

-f 3196-745P- 5846-25L4 22 + 1-07144. 

The values of ^4 2 2 corresponding to a range of values of P, and the 
appropriate minimum values of A 1X as calculated from equation (6), 
are given in Tables 13-5-2 and 13-5-3. In the case P = 0, only the 
greater value of A 22 is used. 





392 


PREDICTED BEHAVIOUR OF THE SYSTEM 13*5-13*6 


Table 13-5-2 


P x 10 s 
(slug ft. 2 ) 

A ti x I0 8 (slug ft. 2 ) 

First value 

Second value 


1-780419 

0-204299 


2-636547 

Negative 

m 

2-936515 

M 

-6-0 

2-909695 

»» 


Table 13-5-3 


Px 10 s 

0 

-2-0 

-4-0 

-6-0 

ax F 

2-41813 

5-54148 



fix V 

2-704 

2-704 



a 

0-6016 

0-6016 



b x 10* 

7-70280 

10-63875 

11-16854 

10-28450 

c 

1-07594 

1-95262 

2-25979 

2-23233 

d x 10* 

-9-54798 

-17-29053 

- 30*23657 

-49-61866 

(Pb-ad)Vx 10* 

43-917 

124-582 

214-796 

284-071 

(a c-pa)V 

0-9570 

9-1937 

12-1695 

9*9024 

Minimum A u x 10 8 

45-041 

13-551 

17-650 

28-687 


Prom the results it is seen that quite a wide range of the inertial 
constants is permissible. A rough estimate of the inertias for the 
wooden aerofoil and some of its attachments indicated that the value 
P = — 4 x 10~ 3 , and the corresponding value of A 22 , would be realisable 
in practice. The actual value of A n was expected to exceed greatly the 
minimum required by Table 13-5*3, and a round figure of 40 x 10“ 3 
was adopted for the further discussion of the test system. 

13*6. Theoretical Behaviour of the Test System. In accord¬ 
ance with the conclusions drawn in § 13-5 the dynamical coefficients 
were chosen as follows: 

Table 13-6-1 

Dynamical Coefficients for Theoretical Test System 


Coefficient 

Value x 10 s 

Coefficient 

Value x 10 s 

B U 

ell 

A lt (=P) 

c" 

° 1 * 

40-0 

3-38F 

10*<7!-5-76 F* 

-4-0 
- 0-46 F 
-5-76F 2 

A n [=P) 

c n 

A 1 

t 

C « 

-4-0 

0-16F 

0-467 F 2 
2-936515 

0-47 F 

0-467 F 2 


(i) Behaviour at a Critical Speed . The critical value of X (see 
(13-5-7)) leading to simply sinusoidal oscillations of the frictionless 
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system is readily found to be X c = 31-7051, and the critical speed V e 
is then fixed in terms of <r t by the relation 

lOVi = 37-4651F* 

The coefficients of the determinantal equation for the same condition 
work out as ^ = 101-4606 x 10-«, 

p x = 27-52541; x 10-«, 
p 2 = 92-2727F£ x 10~®, 
p a = 17-6163F®x 10-*, 

Pi = 17-4962Fc x 10~ 6 . 

Further, the real and imaginary parts of the roots ^ and of the con¬ 
stants \h 2r y r are given by 

p = 0, o) = 0-81;, 

p' = -0-1356461;, o' = 0-5010411;, 

VJ = 0-077728 x 10 3 , V e y = - 0-055964 x 10 3 , 

F c £' = - 0-077728 x 10 3 , V e ^ = 0-603823 x 10 s . 

Lastly, the equation which determines the semi-period T is 

0 — iD.(T) — ^ — V s * n T (,> sinh Tp' — i)' sin To' 

® — cosh Tp + cos T<o cosh Tp' -I- cos Tcj' 

It is found that D(T) vanishes when TV C = 3-6563, and that the graph 
of D(T) has an asymptote at TV C = 3-9270. The first value of TV C 
corresponds to an unstable steady oscillation which, if disturbed, either 
dies to ankylosis or grows indefinitely. 

(ii) Behaviour at a Speed Less than the Critical. If we next assume 
the airspeed to be reduced to V = 0-9751;, the coefficients of the 
determinantal equation become 

p 0 = 101-4606 x 10~«, 
p 1 = 26-8373F c xl0-«, 
p 2 = 93-1488F 3 x 10- 6 , 
p 3 = 17-1755F?x 10-«, 
p 4 = 16-6323FJ x 10-«, 

while p — — 0-0088951;, (o = 0-8175391;, 

p' = - 0-1233601;, o' = 0-4796051;, 

V 0 g = 0-052411 x 10 3 , V C T) = - 0-051047 x 10 3 , 

VJ' = -0-052411 x 10 3 , V c i/' = 0-510530 x 10 3 . 
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In this case the lowest non-zero roots of the semi-period equation are 
given by TV C = 3*6692 and TV C = 3*7714. The first of these corresponds 
to the unstable oscillations referred to in the critical case, and the 
second (which develops from the asymptote) corresponds to stable 
steady oscillations. The characteristics of this stable motion will now 
be examined. 

With TV C = 3*7714, it is found that 

TX 1 

6 '~ - = -14-826 + 25-764i, 

e TA i +1 

e TA «—1 

-- = -0-65144+ l-lil35i. 

e 2 A » +1 

Moreover V^k^ = (- 0-063130 - 0-06342H) 10 3 , 

F 2 fc 23 y 3 = (1-024789-0-154307i) 10 3 , 

Ai*2i7x = (0-041267 + 0-043302i) 10 3 , 

A§fc 23 y 3 = (-0-238388-0-088116i) 10 s . 
while a 22 = -d u /p 0 = 0-394242 x 10 3 . 

Hence (see (11-6-6) and (11-6-6)) 

4 pTA r _1 

q 2 (0)/R = S^ 2r 7r^n 

= 4-3627 x 10 3 /F 3 , 

4 pTAr-. 1 

q 2 (0)/R = *k ar y r -—+a n 
= -2-6019 x 10 3 . 

These expressions confirm that the oscillations are possible. Again, 
using equation (11*4*1), we find that 

V*k u y x = (-0*038914-0*021948i)10 3 , 

V 2 c k xz y z = (0*203520 — 0*014215t) 10 3 , 
and the motion of the frame is then given by 

».«)/*--r^)- 

A graphical examination shows that the coordinate q x lags in phase 
by 54° behind q 2i and that its maximum value is 235022/F 3 . 

If the critical speed is chosen to be 40 feet per second (corresponding 
to a spring stiffness a x of 59*94 pounds feet per radian), the results 
obtained under the present heading are applicable at a speed of 39 feet 
per second. The amplitude of steady oscillations for the aerofoil then 
is 2*72 radians per foot pound of frictional couple, while that for the 
frame is 1*47. 
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Part II. Experimental Investigation 

13-7. Preliminary Calibrations of the Actual Test System. 

The system as described in § 13-2 was mounted vertically for test in a 
wind tunnel. The following preliminary calibrations of the apparatus 
were made: 

(i) Frictional Couple. The frictional couple applied to the aerofoil 
by the discs C and D (see Figs. 13*2*1 and 13*2*2) was varied in the 
wind tunnel experiments by the addition of masses to the scale-pan 
M outside the tunnel. A direct calibration was made of the couples 
required to overcome the 4 'stick friction” and the “slip friction” due 
to a given mass in M : the frame was locked centrally in these tests. 
In the measurements of slip friction the aerofoil was given a small 
angular velocity, and the couple required to keep it just moving was 
determined. The slip friction was about 60 per cent of the stick friction. 

The frictional couple appeared to be directly proportional to the 
outside load in the scale-pan, so that the weight of the latter balanced 
the tensions in the release springs L to within the limits of observation. 
The final result gave a slip frictional couple on the aerofoil of 0*0167 
pound foot for 1 pound of outside load. 

(ii) Constants of Inertia. These were determined by preliminary 
oscillation tests of the apparatus under gravity, with the axes A A and 
BB horizontal. The measured constants (in slug ft. 2 ) were 

A 22 = 2*3 x 10- 3 , P = - 3*0 x 10- 3 , and A n = 41*1 x 10~ 3 . 

(iii) Spring Stiffnesses. To provide stability four springs were 
attached to the frame, one pair to the top arm K and one pair to the 
bottom arm. A direct calibration with the four springs in place gave 
a stiffness cr 1 of 61*6 pounds feet per radian. 

13-8. Observations of Frictional Oscillations. The observed 
critical speed, as indicated by the occurrence of definitely growing 
oscillations, was about 37 feet per second.* At speeds ranging from 
V = 31 to V = 36 steady oscillations were observed, the amplitudes of 
which could be increased by an increase of the frictional moment; and 
the frequency parameter of these oscillations had a constant value of 
0*41. To this extent the experimental results accorded satisfactorily 
with the theory. 

* If the measured values of the inertial constants and of a x are substituted for the values 
given in Table 13-6*1, the calculated critical speed is almost exactly 37 feet per second. 
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The measured amplitudes of the frame during the steady oscillations 
are summarised in Table 13-8-1 and Fig. 13-8*1. For a given wind speed 
they should, according to the theory, increase linearly with the fric¬ 
tional moment, but this is only borne out by the experimental results 
for a restricted range of the friction. Within this range*, the measured 
increase of amplitude for an additional frictional moment of 0*01 
pound foot is roughly 0-5 degree at V = 35. No exact comparison with 
theory is possible here, but it may be noted that for the system dis¬ 
cussed in § 13*6 the calculated increase of amplitude is 0*84 degree at 



a wind speed of 1 foot per second less than the critical speed. The 
behaviour of the actual Bystem differed from that of the theoretical 
system in that its oscillations developed spontaneously from the small 
natural disturbances in the wind tunnel: the theory requires the initial 
disturbance to be at least comparable with the final motion. 

Among the possible causes which might well account for the differ¬ 
ences of behaviour just mentioned, reference may be made to the 
following: 

(i) Departures of the actual aerodynamical forces from linear laws.f 

* Beyond this range ankylosis occurred, as indicated by “A” in Table 13*8* 1 and by 
dotted lines in Fig. 13*8-1. 

t The non-linearity of these forces was to some extent confirmed by measurements of 
the aerodynamical stiffnesses for a static condition of the wing. 
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(ii) The presence of some friction in the ball-bearings, and increase 
of this internal friction with the amplitude of oscillation. 

(iii) Departures from constancy of the externally applied friction 
during each stroke of the aerofoil. 

A very close correlation between theory and experiment was, in 
fact, found impossible owing to the susceptibility of the actual test 
system to three other types of oscillation which were almost certainly 
attributable to the departures of the aerodynamical forces from linear 
laws. A brief description of these oscillations is given in § 13*9. 

Table 13* 8-1 


Amplitudes of Frame during Steady Frictional Oscillations 


V 

Frictional Couple Applied (lb. ft.) 


0 

5-26 x 10~ 8 

10-5 x 10“ 8 

15-75 x 10- 8 

21 x10"* 

26-25 x 10- 8 

31 

01° 

0-4° 

A 

_ 

_ 

_ 

32 

0-25° 

0-55° 

A 

— 

— 

— 

33 

0-65° 

0-85° 

1-1° 

A 

— 

— 

34 

0-85° 

1*1° 

1-25° 

1-45° 

A 

— 

35 

0-95° 

12° 

1-5° 

1-9° 

215° 

2-2° 

36 

B 

— 

— 

— 

— 

— 


Remarks on Table 13*81. “A” indicates ankylosis. The method 
adopted was to increase the friction slowly, so that the amplitude could 
build up slowly to the steady value. If the friction was suddenly 
applied, the existing oscillation did not represent, in general, a suffi¬ 
cient disturbance to give rise to the new and larger oscillation; con¬ 
sequently the motion would die and the friction would then ankylose 
the aerofoil. Owing, presumably, to aerodynamical non-linearities, it 
was in practice impossible to build up the amplitude indefinitely, even 
slowly; and ankylosis would occur as indicated in the table. 

“B” indicates the most curious of all the oscillations obtained. At 
V = 36 and with zero frictional load, the oscillations grew to an 
amplitude of about 2*5°. Then, quite abruptly, the frequency dropped, 
and the amplitude immediately started to decrease. When it reached 
about 1°, the oscillation reverted to the frictional type, and the 
amplitude again increased, and so on. The result was a “hunting” 
oscillation which grew on one frequency and died on another. The 
decreasing oscillation was recognised to correspond to a sustained 
oscillation discussed in § 13*9, and there referred to as oscillation No. 1. 
Its existence was attributed to aerodynamical non-linearities. 
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13*9. Other Oscillations Exhibited by the Test System. 

(i) Oscillations with the Frame Locked . Two types of sustained 
oscillation of the aerofoil were observed when the frame was locked 
in the central position. They will be referred to as oscillations Nos. 1 
and 2. No. 1 was a smooth oscillation of half angle about 5 degrees, and 
characterised by a low value of the frequency parameter. No. 2 was 
a much larger oscillation, the half angle being about 20 degrees, and 
it was of a very jerky nature and profusely embroidered with overtones; 
the frequency parameter was appreciably higher than that of No. 1. 
To generate No. 2 it was always necessary to give the aerofoil a very 
large displacement (50 degrees or higher) and to allow it to swing down 
to the sustained state: the stability of the oscillation thus appeared to 
depend on the establishment of a turbulent oscillatory wake. 

At low wind speeds, the equilibrium configuration was spontan¬ 
eously unstable (i.e. growing oscillations would result from the natural 
disturbances present in the wind tunnel), and the oscillations increased 
to No. 1. Following a large initial disturbance, the motion decreased 
to No. 1, although there was a tendency for No. 2 to appear. Between 
V — 10 and F = 24 both oscillations were realisable; between F = 24 
andF = 30 only No. 2 occurred. FromF = 30 upward, all disturbances 
gave rise to decreasing oscillations, the equilibrium position being 
ultimately reached. The effect of friction was in all cases to reduce the 
amplitude, and ultimately, as the friction was increased, to cause 
ankylosis. 

(ii) Oscillations with the Frame Constrained by Springs . In this 
condition it was possible to obtain three different types of maintained 
motion other than the frictional type. UptoF = 25 oscillations similar 
to Nos. 1 and 2 were again realisable. Oscillation No. 3 occurred at 
speeds above about 33 feet per second, and was very violent. Its 
amplitude was roughly 60°, and on several occasions it developed so 
suddenly that it broke the safety grab of the apparatus. Between 
F = 30 and F = 37 it could be produced by releasing the aerofoil from 
a large displacement. At F = 37, if the oscillation corresponding to 
the internal friction only was allowed to become sustained and if 
external friction was then applied impulsively, the motion jumped 
almost instantly to oscillation No. 3. At a speed just above 37 feet per 
second the spontaneous oscillations rose directly to No. 3. This was 
the critical speed for flutter. 
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357, 372-380 
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270 
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computed by use of subproducts, 11-12 
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Control surfaces of aeroplanes, mass-balance 
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Convergence, 
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angular, 250, 255-256, 259, 267 
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generalised, 261 
ignorable, 272-274, 282-283 
normal or principal, 295-298 
Couplings, 284 

Crankshaft, torsional oscillations of, 316-318 
Critical 

frequency, 307 

parameters, for dynamical systems, 291 
root of determinantal equation, 359, 382 
speed for flutter, 291, 302, 358, 359, 389 
Cross 

dampings, 284 
stiffnesses, 281, 284 
Cross-multiplication, 151, 163 
Cyclic coordinates, see Ignorable coordinates 

Damping 

coefficients, 284, 339 
diagram for tail oscillations, 359, 361 
matrix, 284, 288, 385 
Dampings, direct and cross, 284 
D-determinants, 
defined, 157 

of equivalent systems of differential 
equations, 158 

Decaying oscillations, 333, 334, 359, 361 
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full, 67, 86 

multiple, 18, see also Multiply degenerate 
matrices 

of a square matrix, 18 
simple, 18, see also Simply degenerate 
matrices 

Sylvester’s law of, 23, 24 
Degrees of freedom, 259, 261, 273 
Derivatives, 
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aerodynamical, 264, 285, 286 
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385, 386, 391 

generalised aerodynamical, 284 
of matrices, 43-44 
Derived adjoint matrices, 
for simple first-order system, 205 
of characteristic matrix, 73-77 
of lambda-matrix, 61, 62, 64, 166 
Determinantal equation, 
critical roots of, 359 
defined, 61, 157 

for first-order system of differential 
equations, 205 

for linear dynamical equations, 288 
general theorems on roots of, 61, 166 
multiple roots of, 61-64, 166-181, 184, 
198, 199, 206, 288, 292-296, 296, 298 
nature of roots of, for linear conservative 
system, 291-295 
notation for roots of, 61 
Determinants, 
cofactors of elements of, 16 
computation of, 106-108, 110, 116, 120 
differentiation of, 16 
minor, 16 

of square matrices, 16 
symmetrical, 34 
test, 154, 291 

whose minors of a certain order all 
vanish, 17 

Deviation, equations of, 280 
Diagonal, principal, 3 
Diagonal 

matrix, 3, 12-13, 23, 66, 89, 91, 93 
order, 161 

systems of linear differential equations, 
160 

Difference equation, linear, 73, 148 
Differential equations, linear, see Linear 
ordinary differential equations. 
Linear partial differential equations 
Differential operators, 
change of variables in, 48-52 
matrices of, 46-51, 156 
transformation of quadratic, 51-52 
Differentiation, 
notation for, 44 

of bilinear or quadratic forms, 48 
of determinants, 16 
of matrices, 43-62 

of matrix exponential function, 45-46 
Direct 

dampings, 284 
elastic stiffnesses, 281 
solution of linear ordinary differential 
equations, 191-195, 201, 207-209, 
295 

stiffnesses, 284 

Direction cosines, matrix of, 34, 247, 251 
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Discriminants, 
defined, 30 

of kinetic energy function, 272 
of quadratic form, 30-33 
Displacement diagram, for graphical dis¬ 
cussion of friction problems, 355- 
367, 373-374 
Distributive law, 10 
Disturbances, 

effects of arbitrary initial, 334 
effects of large initial, 334, 361, 398 
Disturbed 

equilibrium of conservative system, 281 
motions, 280-281 

steady motion of aeroplane, 284-287 
steady motion of conservative system, 
282-283 
Divergence, 290 
Division, 

of lambda-matrices, 58-60 
of matrices, 22 

on the left and on the right, 60 
Divisors, elementary, 92 
D-matrix, 167, 159 
Dominant latent roots, 
calculation of, 134-143 
defined, 80 
Down-stroke, 335 
Dynamical equations, 
for systems with solid friction, 336-345 
Hamilton’s, 274-276, 289 
Lagrange’s, 269-272, 277-279 
linear, see Linear dynamical equations 
Dynamical matrix, 
canonical form of, 298, 310 
constructed by use of flexibility matrix, 
309 

defined, 308 
modal matrix of, 298 
Dynamical systems, 

conservative, *ee Conservative system 
constituentttnotions of linear, 289-291 
defined, 266 

forced oscillations of, 302-307 
geometrical equations of, 262 
holonomous, 260, 261, 269-272 
non-holonomous, 260 
non-linear, 334 
reciprocal property of, 276 
with ignorable coordinates, 272-274, 
282-283 

with solid friction, see Solid friction 

Elastic stiffnesses, 264, 281 
Element, of matrix, 1 
Elementary 
divisors, 92 

operations on matrices, 87-89, 90, 96, 
97 


Energy, 

equation of, 276 
kinetic, 257, 270, 272, 275, 282 
potential, 263, 264, 271, 326 
Equality of matrices, 4 
Equation, 

algebraic, of general degree, 148-155 
characteristic, 64, 67, 70, 142, 157, 205 
determinantal, see Determinantal equa¬ 
tion 

indicial, 215 

semi-period, for steady frictional oscilla¬ 
tions, 346, 348-350, 364, 370, 393 
Equations, 

dynamical, for frictional systems, 335-345 
geometrical, of a dynamical system, 262, 
277 

Hamilton’s dynamical, 274-276, 289 
Lagrange’s dynamical, 164,194,196,197, 
269-272, 277-279, 289 
linear algebraic, 27, 96-97, 125-133 
linear difference, 73, 148 
linear dynamical, see Linear dynamical 
equations 

linear ordinary differential, see Linear 
ordinary differential equations 
linear partial differential, 60-52, 227 
terminal, 342 
Equilibrium, 
conditions for, 262-265 
conservative system disturbed from, 281 
Equivalent 

lambda-matrices, 90-92 
matrices, 89 

systems of differential equations, 158, 
159, 161 

Existence theorems, 212-214 
Exponential function of square matrix, 
applied in method of mean coefficients, 232 
defined, 41-43 
differentiation of, 45-46 
solution of differential equations by, 
209-211, 221, 333 

Factors, 

invariant, 91-92 
linear, 92 

Fields of force, conservative and non¬ 
conservative, 263-265 
Finite rotations, matrices representing, 
248-249, 251-254 
Flexibilities, 265 
Flexibility matrix, 
defined, 265 

used to calculate dynamical matrix, 309 
Flexural oscillations of tapered beam, 318- 
320 

Flexure-torsion oscillations of aeroplane 
wing, 266, 305-307, 328-331 
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Fluid flow across annulus, 229-230 
Flutter, 

critical speed for, 291, 302, 307, 358, 359, 
389 

experimental observations of, 332,333,395 
influence of solid friction on, 360, 362 
practical meaning of, 359 
simple theory of, 358 
Forced motion of deviation, 280 
Forced oscillations, 
of aerodynamical systems, 302-307 
resonant, 183, 305 
Forces, 

conservative, 263-265 
generalised, 263-265, 270, 277 
Forms,, bilinear and quadratic, 28-30 
Fractional powers of matrices, 38, 81, 82 
Frame of reference, 

angular coordinates of a three-dimen¬ 
sional, 250, 255, 256 

angular momenta referred to a moving, 
257 

angular velocities of a moving, 248, 253, 
255 

defined, 246 

kinetic energy referred to a moving, 257 
Lagrange's equations with a moving, 
277-279 

moving, in two dimensions, 247-248 
velocities and accelerations referred to a 
moving, 248, 256 

Free 

disturbed steady motion of an aeroplane, 
284-287 

motion of deviation, 280 
Freedom, degrees of, 269, 261, 273 
Frequencies, 

calculated by iterative methods, 308-331 
calculated by Rayleigh’s principle, 300, 
310, 315 

conservative system with equal, 198-200, 
292-295, 309, 310, 322-325 
Frequency 

diagram, for tail oscillations, 359, 361 
parameter, for aerofoil oscillations, 385 
Friction, solid, see Solid friction 
Frobenius, method of, 215 
Fundamental solutions of linear differential 
equations, 214-215, 219, 220, 223, 
224 

Fuselage, oscillations involving torsion of, 
359-381 

Galerkin, method of, 224-231 
Generalised 

aerodynamical derivatives, 284 
coordinates, transformation of, 282 
coordinates, velocities, and accelerations, 
260-262 


forces, 263-265, 270, 277 
momenta, 274-276 

moments and products of inertia, 281, 
284 

Geometric constraints, 259-260, 261, 

270 

Geometrical equations, for a dynamical 
system, 262, 277 
Graphical treatment 

of binary frictional systems, 354-367, 
372-380 

of conditions for steady oscillations, 387- 
389 

Gyrostatic coefficients, 278, 279, 283 
Hamiltonian 

equations of motion, 274-276, 289 
function, 275 
Heaviside, 

identity of, expressed by matrices, 176 
method of, 197, 203-204 
Hermitian matrix, 33, 155 
High powers of a matrix, 80, 133-145 
Holonomous, or holonomic, systems, 
defined, 260 

generalised coordinates of, 261 
Hamilton’s equations for, 274-276 
Lagrange’s equations for, 269-272, 277- 
279 

Homogeneous 
strain, 258 

system of differential equations, 157 

Idempotent matrices, 79 
Ignorable coordinates, 
defined, 272 

disturbed steady motion of systems with, 
282-283 

dynamical equations for systems with, 
272-274 

Independent variables, change of, 48-51 
Index law for matrices, 37-38 
Indicial equation, 215 
Inertia, 

generalised moments and products of, 
281, 284 

principal axes of, 257-258 
Inertia 
force, 269 
matrix, 284, 288 

Infinite series of matrices, 40-41, 63, 81 
Influence numbers, 265 
Integration of matrices, 52-56 
Interpolation formula, Lagrange’s, 40 
Invariant factors, 91-92 
Inverse matrices, see Reciprocal, Reciproca¬ 
tion 

Irregular singularity, 215 
Isolation, method of, 197 
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Iterative methods, 

for coefficients in Sylvester’s expansion, 
138-141 

for construction of characteristic equa¬ 
tion, 142 

for frequencies, 308-331 
for latent roots, 134-138, 140-145, 148- 
151 

for linear algebraic equations, 132-133 
for linear dynamical problems, 308-331 
general remarks on, 150-151, 309-310 
modal columns calculated by, 141-145, 
150, 151, 308-310 

overtones calculated by, 309, 312-314, 
325, 330-331 

twist of aeroplane wing calculated by, 
325-327 

Jacobians, relations between, 49 

Kinematic constraints, 259-260, 261, 270 
Kinetic energy, 
discriminants of, 272, 337, 370 
expressed by generalised coordinates and 
velocities, 270, 282 

expressed by generalised momenta and 
velocities, 275 

expressed by submatrices, 272, 282 
expressed for moving axes, 257 
Kinetic potential, 271 
Kinosthenic coordinates, see Ignorable 
coordinates 

Lagrange’s equations, 
constructed, 269-271 
expanded form of, 271-272, 281 
for systems referred to moving axes, 277- 
279 

linear, direct solution of, 194, 195 
linear, expressed as first-order system, 
164, 289, 327 

linear, special solution of, 197, 198-200 
modified for ignorable coordinates, 272- 
274, 282-283 

Lagrange’s interpolation formula, 40 
Lagrangian function, 
defined, 271 

modified, for ignorable coordinates, 272- 
274, 282 

Lambda-determinant, 157 
Lambda-matrices, 
equivalent, 90-92 

multiplication and division of, 58-60 
Smith’s canonical form for, 91-92, 181 
Lambda-matrix, 
adjoint of, 61-64, 165-167 
defined, 57, 157 
degree of, 57 

derived adjoints of, 61, 62, 64, 166 
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elementary divisors of, 92 
invariant factors of, 91-92 
leading matrix coefficient of, 57 
linear factors of, 92 
rank of, 57, 91, 182 
reciprocal of, 58 

remainder theorems for, 59, 60, 70 
Laplacian operator, transformation of, 
51 

Latent roots, 

calculated by iterative methods, 134-138, 
140-145, 148-151 

classical submatrices containing, 94 
defined, 64 

diagonal matrix of, 66 
dominant, 80, 134-143, 148, 309 
equal, 65, 67-71, 75-77, 83-87, 93, 135, 
137, 138, 140 

non-dominant, 143-145, 331, see also 
Overtones 

of Hermitian matrix, 155 
of matrices connected by collineatory 
transformation, 69 
of polynomial of a square matrix, 69 
of symmetrical matrix, 155 
situation of, 155 

Leading matrix coefficient, of a lambda- 
matrix, 57 
Line matrix, 2 
Linear algebraic equations, 

expressed as matrix equation, 8, 18 
solution of, 27, 96-97, 125-133 
solution of, by direct operation on rows, 
130-131 

solution of, by iterative methods, 132-133 
solution of, by post-multiplication, 126- 
130 

Linear difference equation, 73, 148 
Linear differential equations, see Linear 
ordinary differential equations. 
Linear partial differential equations 
Linear dynamical equations, 
confluent special solution of, 198-200 
direct solution of, 191-192, 194, 195, 
295 

for conservative system disturbed from 
equilibrium, 281-282, 291-300, 308 
for conservative system disturbed from 
steady motion, 282-283 
iterative solutions of, 308-331 
notation and terminology for, 288-289 
reduced to first-order system, 164, 289, 
327 

reducing variables for, 195 
special solution of, 197 
transformation of coordinates in, 282 
Linear factors, 92 
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linear ordinary differential equations with 
constant coefficients, 
adjoint matrices for first-order system of, 
205 

boundary problems with, 186 
change of order of boundary conditions 
for, 190-191 

characteristic numbers for, 187-188 
complementary function of, 157, 178-182 
confluent special solution of, 198-200 
constituent solutions of, 167-175 
conversion of, to first-order system, 162- 
165, 202 

“diagonal” systems of, 160 
direct solution of, for one-point boundary 
problem, 191-195, 207-209, 295 
direct solution of, for two-point boundary 
problem, 201 

equivalent systems of, 158, 159, 161 
fundamental theorem for systems of, 161 
general solution of simple first-order 
. system of, 206-207 
Heaviside’s method for, 203-204 
homogeneous and non-homogeneous sys¬ 
tems of, 157 

method of isolation for, 197 
notation for one-point boundary prob¬ 
lems with, 188-189 

notation for two-point boundary prob¬ 
lems with, 200-201 

notation for systems of, 156-157,178-180 
order of, 166 

particular integral of, 167, 183-185, 196, 
209 

simple first-order system of, 202,209-211 
solution of, when A(A) vanishes iden¬ 
tically, 181-182 

special solution of, for standard one-point 
boundary problems, 195-200 
special solution of general first-order 
system of, 203-205 

special solution of simple first-order 
system of, 207 

standard one-point boundary problems 
with, 189, 190 

transformation of dependent variables in, 
159-160 

triangular systems of, 160-162 
Linear ordinary differential equations with 
variable coefficients, 

characteristic numbers for, 224, 226-228, 
230-231 

continuation formula for systems of, 

219- 222 

existence theorems for, 212-214 
first-order systems of, 213-214, 216-218, 

220- 224, 232-233 

fundamental solutions of, 214-215, 219, 
220, 223, 224 


indicial equation for, 215 
irregular singularities of, 215 
notation for system of, 215 
ordinary points of, 212 
Peano-Baker method for first-order 
system of, 217-218 

power series solution for first-order 
system of, 222-224 
regular singularities of, 215 
simple first-order system of, 216,217-218, 
220-222, 232-233 
singularities of, 212-215, 221 
solution of, by collocation and Galerkin’s 
method, 224-231 

solution of, by mean coefficients, 232-245 
solution of, by method of Frobenius, 215 
system of, reduced to first order, 215-217 
Linear partial differential equations, 
change of variables in, 50-62 
methods of collocation and Galerkin for, 
227 

Linear substitution, 26-27, 64 
Linear transformation, 
defined, 26 
matrix of, 27 
triangular, 31 

Linear vector function, 268 

Loading, principal directions of, 265-266 

Mass-balance of aeroplane control surfaces, 
302 

Matrices, 

addition and subtraction of, 4 
conformable, 6, 9, 14, 26 
conjugate, 33 

continued products of, 9-12, 221 
differentiation of, 43-52 
division of, 22, 58-60 
elementary operations on, 87-89,90,96,97 
equal, 4 

equivalent, 89, 90-92 
fractional powers of, 38, 81, 82 
infinite series of, 40-41, 53, 81 
integration of, 52-56 
inverse, 22, see also Reciprocal,Reciproca¬ 
tion 

multiplication of, 6-12 
notation for, 1-3 

of differential operators, 46-51, 156 
partitioned, 13-15 
permutable, 6, 7, 42, 44 
predivision and postdivision of, 22 
products of, 6-12 

reciprocal, 22, see also Reciprocal, Reci¬ 
procation 

representing complex scalars and quater¬ 
nions, 35-36 

representing finite rotations, 246-249, 
251-254 
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Matrices (cont.) 

Taylor’s theorem for, 44-45 
which commute, 6, 39, 42 
with null product, 23-24 
Matrix, 

adjoint, 21, see also Adjoint matrix 

aerodynamical stiffness, 284, 285 

alternate, 3, 33 

characteristic, 64 

column, 2 

D-, 157 

damping, 284, 288, 385 
definition of, 1 

degeneracy of a square, 18, see also 
Degeneracy 
derived, 43-44 
determinant of a square, 16 
diagonal, 3, 12-13, 23, 66, 89, 91, 93 
dynamical, 308, 309, 310 
exponential function of a square, 41-43, 
45-46, 209-211, 221, 222, 232 
flexibility, 265, 309 
Hermitian, 33, 155 
idempotent, 79 
inertia, 284, 288 
integrated, 62-56 

lambda-, 67, see also Lambda-matrices. 
Lambda-matrix 
line, 2 

modal, 66, 179, see also Modal matrix 
multiply degenerate, 18-20, 23, 61, 62, 
65, 67, 70, 85 
non-singular, 18 
null, 3, 23 
order of, 2 

orthogonal, 33, 34, 35, 247, 251 
of a linear transformation, 27 
of direction cosines, 34, 247, 251 
of transformation, orthogonal, 247, 251, 
255 

polynomial of a square, 39-40, 45, 67, 69, 
78-80, 83-87 

powers of a square, 10, 37-38, 67, 72, 73, 
80, 133, 145 

rank of a square, 18, 23, 57, 89-90, 
91 

rectangular, 1 
resolvent of a square, 78 
row, 2 
scalar, 13 

simply degenerate, 18, 61, 65, 71, 86 
singular, 18 
skew, 3 

skew symmetric, 3, 26, 33 
square, 3 

stiffness, 264, 284, 288, 308 
Sylvester’s expansion for a square, 86 
symmetrical, 3, 26, 33-34, 77 
total stiffness, 284 


transposed, 3, 25 
triangular, 97-106 
unit, 3, 13, 35 

Matrizant, 53-56, 218-219, 222, 232 
Mean coefficients, method of, 232-245 
Membrane, vibrating annular, 230-231, 
243-245, 320-322 
Minor determinants, 16 
Mittag-Leffler star, 52 
Modal coefficient, 179 
Modal columns, 

calculated by iterative methods, 141-145, 
150, 151, 308-310 

connected with the characteristic matrix, 
64-69 

defined, 64, 179 

for general linear dynamical system, 288- 
290, 327, 329, 330 

for linear conservative system, 199, 291- 
295, 299,300,308-310, 312-314,324, 
325 

for simple first-order system, 206 
for systems with solid friction, 336 
orthogonal properties of, for conservative 
systems, 299, 300 
properties of, 77 

relations between, for dissipative systems, 
301 

Modal matrix, 
defined, 66, 179 

for linear dynamical system, 289 
for simple first-order system, 206 
for systems with solid friction, 336 
of dynamical matrix, 298 
of polynomial of a matrix, 69 
of symmetrical matrix, 77, 259, 266 
reciprocal of, 77, 86 
Modal row, 336 
Moments of inertia, 
generalised, 281, 284 
principal, 257-258 
Momentum, 
angular, 257 
generalised, 274-276 

Morris, method of, for solution of linear 
algebraic equations, 132 
Moving axes, 

angular velocities of, 248, 253, 255 
angular momenta referred to, 257 
carried positions of, 250 
defined, 246 

in three dimensions, 260-256 
in two dimensions, 247-249 
kinetic energy expressed for, 257 
Lagrange’s equations referred to, 277-279 
velocities and accelerations referred to, 
248, 256 

Multi-cylinder engine, torsional oscillations 
of, 316-318 
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Multiplication, 
of lambda-matrices, 58 
of matrices, rules for, 6 
of partitioned matrices, 14 
Multiply degenerate matrices, 
adjoints of, 21, 62, 63, 65, 123, 125, 173, 
174 

corresponding to multiple latent roots, 
66, 67, 69, 70, 71, 85, 87 
corresponding to multiple roots of deter- 
minantal equation, 61, 63 
defined, 18 

expressed as products, 20 
products of, 23 

National Physical Laboratory, experiments 
at the, 332, 333, 382, 395-398 
Non-holonomous systems, 260 
Normal coordinates, 295-298 
Null 

matrices, 3 

product, square matrices with, 23-24 
Nullity, 
defined, 18 
Sylvester’s law of, 23 

One-point boundary problem, 
confluent special solution for standard, 
198-200 
defined, 186 

direct solution of, 191-195, 207-209, 295 
notation for, 188-189 
order of conditions for, 189, 190-191 
power series solution of, 209-211 
special solution for standard, 195-197, 
203-206, 207 
standard, 189-190 
One-sided constraint, 260 
One-signed quadratic forms, 30 
Operational formulae connected with partial 
fractions, 177-178, 184, 190, 196, 
203, 204 
Operators, 

matrices of differential, 46-51, 156 
selective, 82 
Order, 
of matrix, 2 

of one-point boundary conditions, 189, 
190-191 

of system of linear differential equations, 
156 

Ordinary differential equations, linear, 
see Linear ordinary differential equa¬ 
tions 

Ordinary points of linear differential equa¬ 
tions, 212 
Orthogonal 

matrix, 33, 34, 35, 247, 251 
property, 77, 259, 299, 300 


Oscillations, 
bounded, 333, 334 
decaying, 333, 334, 359, 361 
flexural, of tapered beam, 318-320 
forced, of aerodynamical systems, 302- 
307 

of aeroplane tail, see Aeroplane tail 
of aeroplane wing, 266, 305-307, 328-331 
of beams, 309, 314-316, 318-320 
of systems with solid friction, see Solid 
friction 

of triple pendulum, 310-314 
pitching, of an aerofoil, 382-398 
resonant forced, 183, 305 
spasmodic, 332, 333 
steady, see Steady oscillations 
torsional, of multi-cylinder engine, 316- 
318 

torsional, of uniform cantilever, 314-316 
transverse, of annular membrane, 230- 
231, 243-245, 320-322 
transverse, of stretched string, 188 
unbounded, 333, 334, see also Flutter 
Oscillatory instability, conditions for, 291 
Overtones, calculation of, by iterative 
methods, 309, 312-314, 325, 330-331 

Parameters, critical, 291 
Partial 

differential equations, linear, 60-51, 227 
differential operators, matrices of, 47-61 
fractions, matrix formulae dependent on, 
78, 83, 175-178, 184, 190, 196, 203, 
204 

Particular integral, 157, 183-185, 196, 209 
Partitioning, 

of bilinear and quadratic forms, 29-30,272 
of matrices, 13-15 

Peano-Baker method of integration, 216, 
217-218 

Pendulum, oscillations of a triple, 310-314 
Permutable matrices, 6, 7, 42, 44 
Pitching oscillations of an aerofoil, 
conditions for steady frictional, 387-394 
derivatives appropriate to, 385-386, 391 
design of apparatus for, 384r-394 
experiments on, 382-384, 395-398 
Point, or column matrix, 2 
Polynomial of a square matrix, 

constructed by collineatory transforma¬ 
tion, 67 
defined, 39 

evaluated by Sylvester’s theorem, 78-80, 
83-87 

factorisation of, 39, 40 
latent roots and modal matrix of, 69 
reduction of, by Cayley-H&milton 
theorem, 72 

Taylor’s theorem for, 44 - 4 5 
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Positional coordinates, of a rigid body, 259, 
see also Angular coordinates, General¬ 
ised coordinates 

Positive definite quadratic form, 30 
Postdivision, 22 
Postmultiplication, 6 
Potential, 

centrifugal, 277 
kinetic, 271 

Potential energy, 263, 264, 271, 326 
Powers of a square matrix, 

constructed by collineatory transforma¬ 
tion, 67 
defined, 10, 37 
fractional, 38, 81, 82 
high, 80, 133-145 

linear difference equation satisfied by, 73, 
148 

reduction of, by Cayley-Hamilton 
theorem, 72 

upper bounds for moduli of elements in, 
41, 43, 145-147 
Predivision, 22 
Premultiplication, 6 
Prime, or row matrix, 2 
Principal 
axes, 257-258 
coordinates, 295-298 
diagonal, 3 

directions of loading, 265-266 
Product chain, or continued product, of 
matrices, 9-12, 221 
Products of inertia, 

elimination of, by conversion to principal 
axes, 257 

generalised, 281, 284 
Products of matrices, 
abbreviated rules for, 8 
continued, 9-12, 221 
defined, 6 
Pure strain, 259 

Quadratic differential operators, 49-52 
Quadratic form, 
defined, 28 
differentiation of, 48 
discriminants of, 30-33, 272 
one-signed, 30 
partitioned, 29-30, 272 
positive definite, 30 
Quadric, axes of a central, 258 
Quaternions, represented by matrices, 35-36 

Bank, 

of a lambda-matrix, 57, 91, 182 
of a square matrix, 18, 23, 89-90 
of equivalent matrices, 89-90 
Rayleigh’s principle, 

for conservative systems, 299-300,310,315 
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generalised for special dissipative sys¬ 
tems, 300-301 
Reciprocal, 

improvement of an approximate, 120- 
121 

of lambda-matrix, 58 
of matrix, 22 
of modal matrix, 77, 86 
of skew symmetric matrix, 26 
of symmetrical matrix, 26 
of triangular matrix, 103-106 
Reciprocal 

property of a dynamical system, 276 
theorem for conservative systems, 265 
Reciprocation, 

by direct operation on rows, 119-120 
by method of postmultipliers, 109-112 
by method of submatrices, 112-118 
reversal rule for, 25 
Rectangular matrix, 1 
Reduced characteristic function, 70, 72 
Reducing variables, 
defined, 195 

displacements, velocities, and accelera¬ 
tions expressed in terms of, 336, 337, 
340, 379 

for system with solid friction, 336, 337 
recurrence relations for, 341, 342 
Reference axes, see Axes, Frame of reference 
Regular singularity, 215 
Remainder theorems for lambda-matrices 
59, 60, 70 

Resolvent, of a square matrix, 78 
Resonance, 183, 305 
Reversal of order in products, 25 
Rodrigues* formula, 254 
Roots, 

latent, 64-87, see also Latent roots 
of algebraic equations, see Algebraic 
equations of general degree. Linear 
algebraic equations 

of determinantal equation, see Deter- 
minantal equation 
of matrices, 38, 81, 82 
Rotations, 

convention regarding sign of, 247 
matrices representing finite, 248-249, 
251-254 

Routh’s test functions for stability, 154 
Row matrix, 2 

Rudder-fuselage flutter, see Tail flutter 
Scalar 

equations, expressed as matrix equation, 
5 

matrix, 13 
multipliers, 4, 5 
Segre characteristic, 94 
Selective operators, 82 
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Semi-period equation, for steady frictional 
oscillations, 346, 348-360, 364, 370, 
393 

Semi-rigid 
body, 266 
cantilever, 266 
wing, 266, 306 

Series of matrices, infinite, 40-41, 63, 81 
Simple first-order system of differential 
equations, 

adjoint matrices for, 206 
continuation formula for, 220-222 
defined, 202, 216 
determinantal equation for, 206 
direct solution of, 207, 208 
general solution of, 206 
modal columns for, 206 
power series solution of, 209-211 
special solution of, 207 
Simply degenerate matrices, 
adjoints of, 21, 61, 62, 63, 66, 68, 76, 123, 
124 

corresponding to simple or multiple 
latent roots, 65, 66, 68, 71, 86 
corresponding to simple or multiple roots 
of determinantal equation, 61, 62, 63 
defined, 18 

Singularities, of linear differential equations, 
212-215, 221 
Singular matrices, 
defined, 18 

expressed as products, 20 
see also Multiply degenerate matrices, 
Simply degenerate matrices 

Skew 
matrix, 3 

symmetric matrix, 3, 26, 33 
Slug, definition of, 387 
Small motions, 

construction of equations of, 280-281 
of aerodynamical systems, 283-284 
Smith's canonical form for lambda-matrices, 
91-92, 181 

Smith's transformation, for an operational 
row, 60 
Solid friction, 

dynamical equations for systems with, 
336-345 

graphical method for systems with, 364- 
357, 372-380 

influence of, on critical speed for flutter, 
362 

influence of, on oscillations of systems, 
332, 333 

measurements of, 395 
oscillations of aeroplane tails with, 359- 
381 

pitohing oscillations of an aerofoil con¬ 
strained by, 382-398 


static and dynamic, 332 
steady oscillations of systems with, see 
Steady oscillations 

system with single degree of freedom and, 
342-344 

Spasmodic oscillations, 332, 333 
Special solution, 
confluent form of, 198-200 
for general first-order system, 203-205 
for simple first-order system, 207 
for standard one-point boundary pro¬ 
blems, 196-197 
Spectral set, of operators, 82 
Sphere, rolling on a fixed plane, 260 
Spiral diagrams, for graphical discussion of 
friction problems, 366-367, 374 
Square matrix, definition of, 3 
Stability, 

of an aeroplane, 285 
of an aeroplane wing, 266 
of equilibrium of a conservative system, 
264 

of steady frictional oscillations, 360-353, 
360, 361, 362, 382 

test functions and determinants for, 154- 
155, 291, 388, 389 

Standard one-point boundary problem, 
defined, 189 

special solution for, 195-200, 203-205, 
207 

Starting instant, 335 
Steady motion, 

aeroplane disturbed from, 284-287 
conservative system with ignorable 
coordinates disturbed from, 282-283 
disturbance from, 280 
of an aeroplane, 268 
Steady oscillations, 

conditions for, when only one coordinate 
is frictionally constrained, 345-350 
graphical treatment of conditions for, 
387-389 

of aerofoil with solid friction, 382, 387- 
397 

of aeroplane tails with solid friction, 360- 
372, 375 

semi-period equation for, 346, 348-360, 
364, 370, 393 

stability of, 350-353, 360, 361, 362, 382 
treatment of conditions for, based on 
inertias, 389-392 
Stiffness, 

aerodynamical, 264, 284, 385 

centrifugal, 278, 279 

cross, 284 

direct, 284 

direct elastic, 281 

elastic, 264 

elastic cross, 281 
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Stiffness 

coefficient, 264, 338 
line, 388, 389 
matrix, 264, 284, 288, 308 
point, 388 

Stopping instant, 335 
Strain, 

homogeneous, 258 
pure, 259 

String, oscillations of a stretched, 188 
Structural problems, linear algebraic equa¬ 
tions in, 132 

Sturm’s functions, 152-153 
Subdominant latent roots, calculation of, 
143-144, 151 
Submatrices, 

bilinear and quadratic forms expressed 
by, 29-30, 272 
defined, 13-15 
of simple classical type, 94 
reciprocation by method of, 112-118 
Subproducts, use of, in calculation of con¬ 
tinued products, 11 
Subsidence, 290 
Substitution, linear, 26-27, 64 
Summation, 

of infinite series of matrices, 41, 81 
of matrices, 4 
Superdiagonal elements, 93 
Sylvester’s expansion, 

coefficients in, calculated by iterative 
methods, 138-141, 310 
confluent form of, 83-87, 134 
constructed, 78-79 

dominant latent roots calculated by, 80, 
133-138 

fractional powers of a matrix obtained by, 
81 

infinite series of matrices summed by, 
81 

properties of coefficients in, 79, 82 
Sylvester’s law of degeneracy, 23, 24 
Symmetrical 
determinants, 34 
matrices, products of, 26, 33 
Symmetrical matrix, 
adjoint of, 34 
defined, 3 
latent roots of, 155 
modal matrix of, 77, 259, 265 
reciprocal of, 26 

Tail flutter, 

critical speed for, 359, 363 
damping and frequency diagrams relating 
to, 359, 361 

influence of solid friction on, 360, 362 
wind tunnel experiments on, 332, 333, 
361 


Tail oscillations, 

friction theory applied to, 359-381 
steady, 360-372, 375, 376, 380 
Taylor’s theorem for matrices, 44-45 
Tensor, 36 
Terminal 
equations, 342 
instant, 335 
Test 

conic, 387-389 

determinants and functions for stability, 
154-155, 291, 388, 389 
Torsional oscillations, 

of aeroplane fuselage, see Tail oscillations 
of multi-cylinder engine, 316-318 
of uniform cantilever, 314-316 
Total stiffness matrix, 284 
Transformation, 

collinoatory, 66, 69, 93, 94, 206 
congruent, 29, 31 
linear, 26-27, 64 
matrix of a linear, 27 
of generalised coordinates, 282 
orthogonal matrix of, 247, 251 
Smith’s, for an operational row, 50 
Transposition of matrices, 3, 25 
Transverse oscillations, 

of an annular membrane, 230-231, 243- 
245, 320-322 
of a stretched string, 188 
Triangular matrix, 
defined, 97 

reciprocal of, 103-106 
reduced to diagonal form, 102-103 
reduction of non-singular matrix to, 97- 
102 

Triangular systems, of linear ordinary 
differential equations, 160-162 
Triple pendulum, oscillations of, 310-314 
Twist of an aeroplane wing, static, 325-327 
Two-point boundary problem, 
defined, 186 
direct solution of, 201 
notation for, 200-201 
power series solution of, 211 

Unbounded oscillations, 333, 334 
Unit matrix, 3, 13, 35 
Upper bounds, 

for an integrated matrix, 52, 55 
for powers of a matrix, 41, 43, 145-147 
Up-stroke, 335 

Vector, 

components of, referred to moving axes, 
247-248, 251, 256 
of first kind, or row matrix, 2 
of second kind, or column matrix, 2 
Vector function, linear, 258 
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Vibrations, see Oscillations 
Virtual work, 262-264, 270, 277 

Wind tunnel experiments, 
on pitohing oscillations of aerofoil, 382, 
396-698 

on tail flutter, 332, 333, 361 
Wing, 

oritical speed for flutter of, 302, 307 


free oscillations of, 266, 328-331 
forced oscillations o£ 305-307 
semi-rigid, 266, 305 
stability of, 266 
static twist of, 326-327 
Work, virtual, 262-264, 270, 277 

Yaw, angular displacements in, 268, 332, 
333 




